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ABSTRACT 

In this paper, we measure the optical-to-virial velocity ratios V^pt/V2ooc of 
disk galaxies in the Sloan Digital Sky Survey (SDSS) at a mean redshift of 
{z) — 0.07 and with stellar masses IO^Mq < < IQ^^Mq. Vopt/^^200ci the 
Q ratio of the circular velocity measured at the virial radius of the dark matter 

halo (~150 kpc) to that at the optical radius of the disk ('-^lO kpc), is a 
powerful observational constraint on disk galaxy formation. It links galaxies 
d to their dark matter haloes dynamically and constrains the total mass profile of 

disk galaxies over an order of magnitude in length scale. For this measurement, 
we combine Vopt derived from the Tully-Fisher relation (TFR) from Reyes et al. 
with V200C derived from halo masses measured with galaxy-galaxy lensing. In 
— anticipation of this combination, we use similarly-selected galaxy samples for 

both the TFR and lensing analysis. For three bins with lensing-weighted 

T— I mean stellar masses of 0.6, 2.7, and 6.5 x lO^^M©, we find halo-to-stellar mass 

ratios M2ooc/M^ = 41, 23, and 26, with la statistical uncertainties of around 

,3 0.1 dex, and yopt/V"2ooc = 1.27±0.08, 1.39±0.06, 1.27±0.08 (Icr), respectively 

Our results suggest that the dark matter and baryonic contributions to the 
mass within the optical radius are comparable, if the dark matter halo profile 

T-H has not been significantly modified by baryons. The results obtained in this 

^ work will serve as inputs to and constraints on disk galaxy formation models, 

. ^ which will be explored in future work. Finally, we note that this paper presents 

a new and improved galaxy shape catalogue for weak lensing that covers the 



U fuU SDSS DR7 footprint. 



Key words: galaxies: spiral - galaxies: kinematics and dynamics - gravita- 
tional lensing: weak. 



1 INTRODUCTION 

The basic picture of disk galaxy formation has long been 
established: both gas and dark matter acquire angular 
momentum through tidal torques in the early Universe 

* rreyesakicp.uchicago.edu 



Peebles||l969 1; the gas then cools and collapses into a 
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rotationally-supported disk at the center of a cold dark 
matter ha lo ([White fc R ees"1978 
T980. Blumenthal et al.l|1984 Da 



Fall & Efstathiou 



canton et al.||1997| 



Mo et al. 



19981. Today, many aspects of the theory 



are still not completely understood, such as star for- 
mation, feedback from supernovae and active galactic 
nuclei (AGNs), angular momentum transfer, mergers, 
and the response of the dark matter halo to the in- 
fall of baryons. State-of-the-art hydrodynamic simula- 
tions, which incorporate effective prescriptions for some 
of these physical processes, are starting to produce in- 
dividual disk galaxies with luminosities, sizes, and rota- 



tion curves comparable with observed ones (Governato 



|et al.|2010||Guedes et al.|2011||Agertz et al.|2011[ ). The 

ultimate test for these simulations is their ability to sat- 
isfy a broad set of observational constraints. In addition, 
observations can serve as a guide to improve the effec- 
tive models used in the simulations. Careful comparison 
between the theoretical models and observations will 
become increasingly important as the simulations con- 
tinue to improve, and eventually produce cosmological 
ensembles of galaxies. 

A powerful observational constraint on disk galaxy 
formation is the dynamical link between disk galax- 
ies and their dark matter haloes, T4pt/V2ooc, the ratio 
of the circular velocity measured at the optical radius 
('"opt ~ 10 kpc, a few optical disk scale lengths) to that 
at the virial radius of the dark matter halo (rvir ~ 150 
kpc). Because the optical-to- virial velocity ratio is a 
dynamical quantity, it does not suffer from uncertain- 
ties in stellar mass estimates from photometry (or spec- 
troscopy), which are subject to uncertainties in dust ex- 
tinction, stellar populations, and the stellar initial mass 
function (IMF) (see, e.g., |Conroy et al.|2009t . 

The optical-to- virial velocity ratio is sensitive to the 
total mass profile within the halo virial radius. For ex- 
ample, an isothermal profile (with a mass density pro- 
portional to r~^) corresponds to K)pt/V2ooc = 1. The 
shape of the halo mass profile, without any modification 
from their interaction with baryons, is reasonably well- 



understood from A^-body simulations (Navarro et al 



1996 1997 Gao et al. 20081. On the other hand, the 



amount of response of the dark matter halo to the in- 
fall of baryons during their collapse, as well as to the 
blowing out of baryons due to feedback effects, is poorly- 
understood and is the subject of active debate in the lit- 
erature, both from the theoretical (jGnedin et al 
Gustafsson ct al 



2004 



2006; Romano-Dfaz ct a T:]|2008| |Jo-| 
hansson ct al..2009 . Duffy et al.,2010,|Abadi et al.|2010| 
Pcdrosa ct al.|2010||Tissera et al.|2010|" Governato et al.| 



2010(|Guedes et al.|20lT]|Gnedin et al.|2011|) and obser- 



vational sides ( Sand et al.j2002 Humphrey et al.|2006 
Zappacosta et al.|200 6 Dutto n et al.|2007 Schulz et al.| 



2010 



Auger et al.||2010i |Dutton et al.||2011[ ). Different 



formation scenarios will generally predict different val- 
ues for T4pt/V200c, which can be directly tested against 
observations. 

The optical-to-virial velocity ratio can be mea- 
sured by combining two measurements. First, Vopt can 



be measured directly for individual galaxies from Ha 
(or HI) rotation curves. Moreover, there is a well- 
established tight relation between Vo-pt and stellar mass 
M« , referred to hereafter as the TuUy-Fisher relation or 



TFR (TuUy & Fisher 19771. Second, the average K 



can be measured for large galaxy samples with galaxy- 
galaxy lensing or satellite kinematics. 

Previously, |Seljak| (|2002j) combined early TFR 
and galaxy-galaxy lensing measurements and inferred 
V^)pt/V2ooc = 1.8 with a 2a lower limit of 1.4, for L* late- 
type galaxies (here, Vvir = V'200c is the circular velocity 
of the dark matter halo at the virial radius rvir = r2ooc, 
the radius within which the mean density is 200 times 
the critical density of the Universe today) . They found 
this result to be consistent with the prediction of the 
standard model of adiabatic contraction of the dark 



matter halo due to baryonic infall (Blumenthal et al. 



19861. More recently, Dutton et al. (20101 combined a 
TFR based on data from [Pizagno et al. ( |2007[ ) with 



published halo-to-stellar mass ratios in the literature 
from galaxy-galaxy lensing, satellite kinematics, and 
halo abundance matching, and found Vopt/V2ooc — 1, 
for disk galaxies with M* ~ lO^^-lO^M©. This result 
is lower than the prediction from the standard adiabatic 
contraction model and suggests that the effect of adia- 
batic contraction is weaker or that the opposite effect 
occurs (i.e., the dark matter halo density within the op- 
tical radius decreases instead of increases). 

In this work, we perform a new measurement of 
K)pt/V200c. We use stacked weak lensing measurements 
of ~10* disk galaxies from the Sloan Digital Sky Survey 



SDSS; York et al. 20001, with well-defined photome- 



try and available fibre spectroscopy, to measure aver- 
age halo virial masses M200C. We combine these mea- 



surements with the TFR derived in Reyes et al. (2011 



hereafter Rll) from a sample of 189 disk galaxies with 
measured Ha rotation curves that is, by construction, 
a fair subsample of the lens sample used in this work. 
Our results constrain the relation between the optical- 
to-virial velocity ratio Vopt/V2ooc and stellar mass M,, 
of disk galaxies with a mean redshift of 0.07 and stellar 
masses M* ~ 10^-10^^ M©. Unlike in previous analyses, 
we use similarly-selected galaxy samples and consistent 
definitions in both the lensing and TFR measurements 
to enable a fair combination of the two. 

This paper introduces a new source galaxy cata- 
logue for lensing, which is demonstrably an improve- 
ment, both in area coverage and quality, over the one in- 
troduced in |Mandelbaum et al.| ( [2005a[ hereafter M05). 
We describe the catalogue properties, various tests of 
systematics, and calibration of the lensing signal in 
Sec. [4] the reader who is not interested in the technical 
details may skip this section. The generation procedure 
for the catalogue is described in Appendix A and the 
differences from the catalogue in M05 are enumerated 
and discussed in Appendix B. 

The organization of the rest of the paper is as fol- 
lows: in Sec. [2] we discuss the method of this work 
and details of the lensing calculations; in Sec. [3] we de- 
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scribe SDSS data and the selection of our lens sample; in 
Sec. [4] we extensively characterize the source catalogue 
and shape measurements used in this work; in Sec. [5] 
we present the measured lensing signals, and the results 
of various systematics tests; in Sec. |6j we describe fits 
to the lensing signal to derive average halo masses; in 
Sec.[7j we present our main results and their interpreta- 
tion; finally, in Sec. [8] we present a summary and discuss 
plans for future work. 

For the calculations performed in this work, we 
adopt the cosmology: fl^ — 0.27, = 0.73, as = 0.8, 
h = _ffo/(100km s~^Mpc~^) = 0.7. Unless otherwise 
stated, quantities already include the appropriate fac- 
tors of h: stellar masses M* scale as h~^, halo masses 
M200C scale as , and distances R scale as . All 
distances are expressed in comoving units, unless other- 
wise noted. 



2 METHOD 

We begin with an overview of the methodology of this 



work (Sec. 2.11. In the rest of the section, we describe 
galaxy-galaxy lensing theory (Sec. 2.21, the calculation 
of the galaxy-galaxy lensing signal (Sec. |2.3| l, and the 
derivation of the TFR from Rll used in this work 



(Sec. 2.41 



2.1 Overview 

In this work, we constrain two relations: (i) the rela- 
tion between the halo-to-stellar mass ratio M2ooc/M, 
and stellar mass M, (or HSMR), and (ii) the relation 
between the optical-to-virial velocity ratio 'K)pt/V2ooc 
and M, (or OVVR). Note that the virial quantities are 
less tightly-constrained than the corresponding "opti- 
cal" quantities. 

To determine the HSMR, we measure the galaxy- 
galaxy lensing signal around stacked galaxies in bins of 
stellar mass M, . We perform fits to the observed lensing 
signals to determine the best-fitting halo-to-stellar mass 
ratio for each stellar mass bin, as described in Sec.j6l 
We also adopt a functional form for the HSMR (Eq. [iS] 
of Sec. 6.2.2 1 and fit the lensing signal for all three bins 



simultaneously to determine the best-fitting parameters 
in the relation. 

To determine the OVVR, we convert the halo virial 
masses M200C in the HSMR into halo virial velocities 
V200C = \/ GM2ooc/»'2ooc (where G is the Newtonian 
gravitational constant), then take its ratio with the rela- 
tion between V^pt and M« from the TFR in Rll. Details 
of these derivations, as well as the results, will be pre- 
sented in Secs. [6T2] fc[7l 



2.2 Lensing theory 

Galaxy-galaxy lensing is the deflection of light from 
sources by the mass in intervening lenses, which shows 
up as a coherent tangential shearing effect. It provides 



a simple way to probe the connection between galaxies 
and matter via their cross-correlation function 



?gm(r) = {S!,{x)6m(x + r}) 



(1) 



where 5g and 5ni are overdensities of galaxies and mat- 
ter, respectively. This cross-correlation can be related 
to the projected surface density 



E(i?) 



■/[ 



1 + 



(2) 



(where = + x^)- For our purposes we neglect 
the radial window function, which is of order 100 Mpc 
broad, well beyond the scales that are important in this 
work. The surface density E(_R) is then related to the 
observable quantity for lensing. 



AE(7?) = 7t(i?)E, = E(< R) - E(i?), 



(3) 



where 7t is the tangential shear. In practice, we truncate 
the integral in Eq. l2|at l/i~^Mpc, which is well beyond 
the halo virial radii (defined in Eq. [Tg] below) of the 
galaxies we study. 

The second relation in Eq. [3] is true only in the 
weak lensing limit, for a matter distribution that is ax- 
isymmetric along the line of sight. This symmetry is 
naturally achieved by our procedure of stacking many 
galaxies and determining their average lensing signal. 
This observable quantity can be expressed as the prod- 
uct of the tangential shear 74 and a geometric factor 



4nG AAs(l + 2i)2 



(4) 



where Di and Ds are angular diameter distances to the 
lens and source, Dis is the angular diameter distance 
between the lens and source, and the factor of {l + zi)~'^ 
arises due to our use of comoving coordinates. For a 
given lens redshift, E~^ rises from zero at Zs = zi to an 
asymptotic value at Zs 3> 21; that asymptotic value is 
an increasing function of lens redshift. 

2.3 Lensing signal calculation 

Calculation of the galaxy-galaxy lensing signal requires 
us to identify pairs of lens and source galaxies within 
some physical separation on the sky. Since we have spec- 
troscopic redshifts for our lens galaxies, we can work 
with physical tangential separations rather than in an- 
gular coordinates (which mixes physical scales for lens 
galaxies at different redshifts). To compute the average 
lensing signal AE(J?), lens-source pairs are first assigned 
weights according to the error on the shape measure- 
ment via 



■Wis 



(Is). 



(5) 



where is the estimated shape error per component 
and asN is the intrinsic shape noise per component, 
which was determined as a function of magnitude in 
M05, figure 3 (in Sec. 4.3.2 of this paper we will reassess 



the accuracy of that shape noise estimate). The factor 
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of (Ec'"')"^ converts the shape noise in the denominator 
to a noise in AE; it downweights pairs that are close in 
redshift. 

Once we have computed these weights, we calculate 
the lensing signal in 23 logarithmic radial bins from 0.02 
to 2/i~^Mpc as a summation over lens-source pairs via: 



AE(i?) 



(6) 



where the factor of 2 arises due to our definition of 
ellipticity and TZ is the shear responsivity, which de- 
scribes how the ellipticity estimator used in this paper 
(Eq. |8| responds to a shear ( Bernstein & Jarvis 2002 
and Sec. |4.3.2 of this work). 

There are several additional procedures that must 
be done when computing the signal (see M05 for de- 
tails). First, the signal computed around random points 
must be subtracted from the signal around real lenses 
to eliminate contributions from systematic shear. The 
measured signal around random points is consistent 
with zero over the range of length scales used in this 



work (c.f Sec. |5. 2.1 1. 

Second, the signal must be boosted, i.e., multiplied 
by B{R) — 7i(_R)/nrand(-R), ratio of the weighted num- 
ber density of sources around real lenses, relative to 
the weighted number density of sources around random 
points, in order to account for the dilution of the lensing 
signal due to sources that are physically associated with 
a lens, and therefore not lensed. The multiplication by 
the boost factor means that our shear estimator is es- 
sentially identical to the globally normalized estimator 
in 



Rozo et al. (20111 



To determine errors on the lensing signal and 
boost factors, we divide the survey area into 200 
bootstrap subregions]^ and generate 500 bootstrap- 
resampled datasets. For illustration, we rebin the sig- 
nal into 7 radial bins and plot the re-binned signal. The 
computed lensing signals, for the real and random galax- 
ies, as well as the boost factors, will be presented in 
Sec.lSl 



2.4 Derivation of the TFR 

By construction, the galaxy sample used to derive the 
TFR in Rll is a fair subsample of the lens sample used 
in this work (defined in Sec. 3.2 below and sec. 3 of 



Rll). There is only one difference in the selection crite- 
ria used: an axis ratio cut {b/a < 0.6) has been applied 
to the TFR sample, but not to the lens sample. We 
found that applying such a cut only modestly changes 
the distributions of basic galaxy properties, such as stel- 
lar mass and galaxy colour (c.f. fig. 1 of Rll), so we do 



^ Ideally we would like contiguous, equal-area subregions. 
Given the SDSS survey geometry, we are forced to compro- 
mise slightly, and go in the direction of requiring strictly 
equal-area regions while allowing a small fraction to be non- 
contiguous. 



not expect it to introduce a significant bias between the 
two samples. Applying the axis ratio cut to the lens 
sample would have decreased the sample size by almost 
half, so we have chosen not to do so. 

In this work, we follow the recommendation of Rll 
and use the TFR between stellar mass M, and optical 
velocity Vopt ~ Vso, the disk rotation velocity at the 
radius enclosing 80 per cent of the i-band galaxy light, 

RsoB 

Here (and throughout t his wo rk) , stellar masses M, 
correspond to the Kroupa ( |2002| IMF[^ As defined in 
sec. 5 of Rll, they are determined from SDSS i-band ab- 
solute magnitudes and g — r colours (both uncorrected 
for internal dust extinction), using stellar mass-to-light 
ratio estimates from |Bell et al.| ( |2003| ) (reduced by 0.15 
dex to account for the difference in the normalization of 
the IMF). Absolute magnitudes Mi are based on Pet- 
rosian apparent magnitudes and galaxy colours g — r 
are based on model apparent magnitudes, described in 



Stoughton et al. ( 2002 1 and Abazajian et al. ( 2004 1. Ab 



solute magnitudes and colours used to determine M, 
were corrected for Galactic extinction using the dust 
maps of Schlegel et al. ( 1998 land fc-corrected to z = 



using the kcorrect product version v4_l_4 of IBlanton] 
& Roweis (20071. We propagate errors from Mi and 
<; — r to determine the error in M* ; for the TFR sample, 
the mean statistical uncertainty in M* (at fixed Kroupa 
IMF) is 0.041 dex. Rll found the best-fitting TFR to 
be 

logl/opt = (2.142 ± 0.004) + (0.278 ±0.010) 

X (logAf, - 10.102), (7) 

with an intrinsic scatter of 0.036 ± 0.005 dex and a 
total measured scatter of 0.056 dex in logV^pt. 



^ Rll showed that Af* yields a TFR with smaller scatter 
than single-band optical luminosities (in ugriz), and simi- 
larly, Vgo performs better than alternative definitions of Vopt , 
such as V2.2 — evaluated at 2.2 times the disk scale length — 
or Vc — the asymptotic circular velocity. The common choice 
for Vopt is V2.2, the rotation velocity at 2.2 times the disk 
scale length R^. We refer the reader to sees. 4 and 5 of Rll 
for the derivation of radii R^ and Rgo (from bulge-disk de- 
composition fits) and of rotation velocities Vgo S'Ud V2.2 (from 
arctangent fits to the measured rotation curves) .Rll showed 
that Rgo is more likely to sample the fiat part of the disk ro- 
tation curve, and as a consequence, Vgo yields a tighter TFR 
than V2.2 (also see Pizagno et al. 2007). Moreover, Rgg is not 
afi'ected by degeneracies involved in bulge-disk decomposi- 
tion fits, unlike R^. The choice of rotation velocity definition 
is important when comparing results from different works, as 
we do later in Sec. 17.31 

^ The normalization or the Kroupa IMF is 0.3 dex lower than 
that for a Salpeter IMF with a lower mass cut-off of 0.1 Mq 
and 0.05 dex higher than that for a Chabrier IMF. 
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3 LENS SAMPLE 

First, we briefly describe SDSS imaging and spec- 
troscopy that we use for both our lens and source galaxy 
samples (Sec. 



3.1 1. Then, we describe the selection of our 



lens sample (Sec. 3.2 1 



3.1 SDSS data 



The SDSS ( [York et al. 12000] ) imaged roughly n steradi- 
ans of the sky, and followed up approximately one mil- 



lion of the detected objects spectroscopically ( Eisenstein 
|et al.|[200T| [Richards et~aL][2002{ [Strauss et al.||2002[ ). 
The imaging was carried out by drift-scanning the sky 



in photometric conditions (iHogg et al.|2001 


Ivezic et al. 


2004[|, in five bands (ugriz) (Fukugita et al. 


1996[ [Smith 


et al.|2002 1 using a specially-designed wide-field camera 



(Gunn et al. 19981. All of the data were processed by 
completely automated pipelines that detect and mea- 
sure photometric properties of objects, and astrometri- 
cally calibrate the data ||Lupton et al.[|200l" Pier et al. 



2003] [Tucker et all2006| . The SDSS I/II imag ing sur- 



veys were completed with a seventh data release ( Abaza- 
jian et aL|[2009 l, though this work will rely as well on 



an improved data reduction pipeline (Photo v5_6) and 



updated photometric calibration (ubercalibration, Pad- 
manabhan et al.|2008 1 that was part of the eighth data 



releas e, from SDSS-III ( |Aihara et "al][2011| |Eisenstein[ 



20111 



Objects are targeted for spectroscopy using the 
imaging data ( [Blanton et al.[[2003a[ ). Main galaxy sam- 
ple targets are selected as described by [Strauss et al.| 
( 2002 1 . The Main galaxy sample target selection in- 
cludes a Petrosian ( [1976[ ) apparent magnitude cut of 
rp = 17.77 mag, with slight variation in this cut across 
the survey area. Targets are observed with a double 320- 
fibre spectrograph on the same telescope ( [Gunn et al.[ 



20061 



Specific subsets of the Main spectroscopic galaxy 
sample, to be described in Sec. |3.2[ will be used as the 
lens galaxies for the lensing analysis described in this 
work. The new source catalogue used here, derived from 
the SDSS imaging data, is described in Sec. |4] 

3.2 Disk lens sample 

We need to select a disk galaxy sample that is adequate 
in size and at sufficiently high redshift for the lensing sig- 
nal to be measurable, given the relatively shallow SDSS 
imaging. Rll defined a parent disk sample adequate for 
this purpose, selected from the SDSS DR7 NYU- Value 
Added Galaxy Catalog (VAGC; [Blanton et al.|2005[ ). In 
this work, we define a lens sample that is made up of a 
majority (76 per cent) of the parent disk sample defined 
in Rll, as described below. 

We begin with a brief summary of the selection cri- 
teria used to define the parent disk sample (we refer the 
reader to sec. 3 of Rll for details). First, galaxies were 
selected to have redshifts 0.02 < z < 0.10 and absolute 



magnitudes —22.5 < Mr < —18.0 mag (before internal 
extinction correction). Then, star- forming galaxies were 
selected by imposing a lower limit on the Ha emission- 
line flux observed through the SDSS spectroscopic fibre, 
as well as mild cuts in Sersic index and emission line ra- 
tios (to exclude active galaxies). Applying these cuts 
yields a sample of 175 920 galaxies]^ 

To select our lens sample from this parent disk sam- 
ple, we first remove galaxies in areas of the sky where 
there are no available source galaxies. This removes 
around 10 per cent of the parent disk sample, leaving 
a total of 158 735 galaxies. We note that a substantial 
percentage of galaxies were removed because of the very 
stringent data quality cuts applied to the source galaxy 
sample (c.f. Appendix A), and not because of a lack of 
data or extreme spatial variation in the source number 
density. 

Second, we aim to construct a lens sample that is 
dominated by central and isolated galaxies so that the 
observed lensing profiles will be simpler to interpret and 



analyse (as described in Sec. 6.11. To do this, we iden- 
tify those galaxies that are most likely to be satellites 
and exclude them from the sample. For each galaxy, we 
count the number of brighter neighbors A''bright within a 
fixed physical transverse radius of 1.14 Mpc and a red- 
shift width of Az = 0.006 (following [Reid et al.[|2009 l. 
Then, we scale A''bright by the number of brighter neigh- 
bors of a random galaxy with the same luminosity and 
redshift We identify "satellite galaxies" as those that 
have a scaled number count greater than or equal to 7. 
This cut excludes 15 per cent of the remaining galaxies 
and yields a disk lens sample of 133 598 galaxies. 

We note that this average satellite fraction is consis- 



tent with Mandelbaum et al. (2006b I, who found using 



halo occupation modelling that typically 10-15 per cent 
of blue/late-type galaocies were satellites. Moreover, we 
flnd, reassuringly, that the satellite fraction we calculate 
is very weakly dependent on redshift, and dependent on 
luminosity in the sense that brighter galaxies are more 
likely to be satellites (as expected since they are more 
strongly biased than fainter galaxies). Finally, we note 
that we apply the same criteria on the random galaxy 
catalogue used in the lensing analysis; the satellite cut 
excludes only 2.4 per cent of the random galaxies. 



The sample presented in Rll had 169 563 galaxies (or 
3.6 per cent fewer). Here, we liave updated the sample to 
include galaxies that had been incorrectly excluded due to 
a failure in the runs of the emission line fitting code. These 
failures were not related to any of the galaxy properties, so 
the addition of these galaxies does not introduce any biases 
to the sample. 

^ For both the real and random galaxy samples, we use the 
same comparison galaxy sample (namely, the DR7 SDSS 
NYU- VAGC). 
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4 SOURCE AND SHAPE CATALOGUE 

This work introduces a new source galaxy catalogue that 
is meant to be an improvement, both in area cover- 
age and quality, over the one introduced in M05 and 
used for subsequent science papers. This catalogue, like 
that from M05, utilises a method of PSF-correction 



known as re-Gaussianization (Hirata & Seljak 20031 



Re-Gaussianization is a method based on the use of the 
moments of the image and of the PSF to correct for the 
effects of the PSF on the galaxy shapes. However, un- 
like many other moments-based corrections, it includes 
corrections for the non-Gaussianity of the galaxy profile 
( Bernstein fc Jarvis|2002| [Hirata fc Selj ak' 2003^ and of 
the PSF (to first order in the PSF non-Gaussianity). 

Details about how the catalogue was generated, and 
an explicit contrast with the catalogue from M05 are in 
Appendices [X] and [B] respectively. A description of its 
properties, and systematics tests, will be presented in 
the following subsections (Sees. 4.1 4.51. 



4.1 New catalogue properties 

The new shape catalogue covers an area of 9 243 deg^, 
with 39 267 029 unique galaxy detections in the SDSS 
DR8 area passing all cuts on photometry, shape mea- 
surements, and photo- 2: described in Appendix [A] (or 
an average source number density of 1.2 arcmin^ ). For 
this work, we use a subset of that area (7 131 deg^) 
corresponding to the DR7 lens catalogue used for this 
work. 

The relevant areas are shown in Fig. [l] the major- 
ity of the "ratty" areas result from imposition of the cut 
on r-band extinction Ar < 0.2 mag, since there are re- 
gions that are close to that limiting value and that get 
shredded by this cut. For science work that might be 
dominated by those reprocessed version of the 

catalogue might be necessary to ensure better coverage. 
However, for the lens catalogue used for this work, those 
areas are not necessary. 

Before describing the catalogue properties, we be- 
gin by introducing the quantities used to describe each 
galaxy. These include the following: 

(i) The extinction-corrected r-band model magni- 
tude, which is a measure of the total galaxy flux. 

(ii) The photo-z, which is calculated using the Zurich 
Extragalactic Bayesian Redshift Analyzer (ZEBRA 
Feldmann et al.|2006 l. Its usage for SDSS lensing stud- 



(2012 



ies was explored thoroughly by Nakajima et al, 
hereafter Nil). 

(iii) The galaxy spectral energy distribution (SED) 
template corresponding to that photo-z (more infor- 
mation about the templates that were used are in Ap- 
pendix [Xj the templates for galaxies that are used for 
science range from to 20, with zero corresponding to 
early types and 20 to late types). 

(iv) The galaxy resolution factor i?2, which expresses 
how resolved it is compared to the PSF. A given galaxy's 
resolution factor thus depends on the conditions under 
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Figure 1. Area coverage of the new shape catalogue, com- 
pared to the old catalogue from M05 (top) and the lens sam- 
ple used for this work (bottom). 



which it was observed. The detailed definition of reso- 
lution factor is given in Appendix |A] Eq. |A5| for the 
purpose of this section, it suffices to know that R2 ap- 
proaches zero for completely unresolved galaxies, one for 
perfectly resolved galaxies, and we require R2 > 1/3 in 
both r and i bands to avoid excessive systematic errors 
in the galaxy shapes. 

(v) The galaxy shape (61,62) and the estimated 
shape measurement error per component. The shapes 
are rotated to a coordinate system in which positive 

61 corresponds to East- West elongation and positive 

62 corresponds to Northeast-Southwest elongation. Our 
shape definition corresponds to 



1 



for minor-to-major axis ratio q. 



(8) 



Details of the derivation of these quantities are in Ap- 
pendix |A] 

Table [l] gives a summary of the basic catalogue 
properties with respect to these quantities. In addition. 
Fig. [2] shows the histogram of apparent magnitude, res- 
olution factor, photo-z, template, and total ellipticity. 
As shown, the number counts do not rise as steeply as 
for a fiux-limited sample, because of the loss of galaxies 
at the faint end due to both the difficulty in measuring 
shapes at low S/N (the flux limit r = 21.8 corresponds 
to S/N > 9.5) and the difficulty in resolving such faint 
galaxies given the typical SDSS seeing. 

While the photo-z histogram in Fig. [2] does not 
match the true dN/dz as well as one might like, the im- 
pact of the significant photo-z errors, az/ (1+z) = 0.113, 
on galaxy-galaxy lensing measurements has been quan- 
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Figure 2. Each panel shows the histogram of source galaxy properties, derived from a random subsample of 5 per cent of the 
catalogue after imposing all shape and photo-z cuts (~ 2 X 10® galaxies). Top left: Histogram of r-band extinction corrected 
model magnitude. Top right: Same, for resolution factor i?2. Lines are shown for the r and i band resolution factors separately, 
and for min(i?2,ri ^2,i) which will be important when we consider selection biases. Middle left: photo-2 histogram, and the 
inferred true dN/dz from Nil. Middle right: Histogram of template values used for inferring photo-z, where lower values 

correspond to redder/earlier- type galaxies. Bottom: Histogram of total ellipticity value (\J e\ + e^), which includes a significant 
contribution from noise. 
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Overall statistics 

Galaxies passing all cuts 39 267 029 

Fraction that passed shape cuts 0.71 

Total area, deg^ 9 243 

Total area of lens sample for this work, deg'^ 7 131 

Mean number density (gal/arcmin^) 1.18 

RMS ellipticity per component 0.36 



Property 




Percentiles 




Mean 




16 


50 


84 




r-band model magnitude 


19.56 


20.79 


21.47 


20.52 


R2,r 


0.46 


0.63 


0.80 


0.63 


photo-2 


0.15 


0.37 


0.55 


0.36 


z (from Nil) 


0.21 


0.39 


0.62 


0.42 


A, 


0.042 


0.079 


0.136 


0.087 


PSF FWHM (r, arcsec) 


1.03 


1.21 


1.42 


1.23 


PSF FWHM (j, arcsec) 


0.95 


1.14 


1.35 


1.15 


lOO-^-^-fT.ky.^ (nmgy) 


0.040 


0.045 


0.052 


0.046 



Table 1. Basic information about the source catalogue presented in this paper, including overall numbers and area; statistics 
of galaxy properties; and statistics of the observing conditions. 



tified by Nil using a training sample consisting of 
9 631 galaxies, and the effects on the lensing signal cal- 
ibration are well understood. 

Fig. |3] shows the fraction of the galaxies that sat- 
isfy all the cuts we impose on the source catalogue 
(i.e., the magnitude, flag and photo-z cuts listed in Ap- 
pendix A) that have usable shape measurements; the 
dependence on apparent magnitude is fairly strong for 
galaxies fainter than r « 21 mag. 

Fig. [4] shows density contour plots (logarithmically 
spaced, with factors of 2.5 in the density) relating the 
resolution factor R2, shape measurement uncertainty 
(7e, and the photo-z to the r-band apparent magnitude. 
First, we see that the r-band magnitude and the reso- 
lution factor R2 are weakly correlated. Naturally, the r- 
band magnitude and the shape measurement errors are 
significantly (positively) correlated. Finally, the r-band 
magnitude correlates with the photo-z as well because 
fainter objects are more likely to be at high redshift. 

4.2 Dependence on imaging conditions 

Given that the catalogue generation procedure entails 
placing a cut on the resolution of the PSF-convolved 
galaxy image with respect to the PSF, the source num- 
ber density is clearly dependent on the imaging condi- 
tions. 

In order to explore this effect, we carry out several 
tests on stripe 82 (-50 <RA< -1-60, -1.25 <Dec< 1.25 
degrees). Conveniently, this area has enough observa- 
tions at any given point that allows us to make multiple 
independent versions of the source catalogue (using the 
procedure from Appendix |A| with different observing 
conditions. 

4-2.1 Realistic range of conditions 

Here we consider the realistic range of observing con- 
ditions covered by the full source catalogue, includ- 




18 19 20 21 

r magnitude 



Figure 3. Fraction of the galaxies that satisfy all the cuts 
imposed on the source catalogue (i.e., in magnitude, flags, 
and photo-^) that have usable shape measurements. 

ing the seeing FWHM and the relevant combination 
of sky noise and extinction that determines the S/N , 
or 10'''*'*'"CTsky in nanomaggies (nmgyj^ Fig. [5] shows 
histograms of these properties for uniformly distributed 
random points within the physical boundaries of the 
source catalogue, and for the source galaxies, which are 
biased in the direction of better seeing (where the see- 
ing in the two bands used for the shape measurement is 
highly correlated). It is clear from this plot that the sky 
noise level has a minimal effect on the probability of a 
source galaxy being included in the catalogue. 

This unit of flux is defined such that the apparent magni- 
tude m = 22.5 — 2.5 logjQ [flux (nmgy)]. 
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Figure 4. Each panel shows contour plots of the density 
of source galaxies in a different 2d parameter space, fn all 
cases, contour levels are logarithmically spaced by a factor 
of 2.5. Top: Density contour plot for r-band magnitude and 
resolution factor R2,r- Middle: Same, for r-band magnitude 
and the total (band-averaged) shape measurement error a^, 
estimated from the sky variance and simple assumptions in 
Eg. |A7[ Bottom: Same, for r-band magnitude and photo-2. 
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Figure 5. Observing conditions in the entire source cata- 
logue, as compared to that for randomly distributed points 
within the same area coverage. Top: Histogram of i-band 
PSF FWHM for source galaxies (solid) and random points 
(dashed lines). Middle: Same, for the sky noise (standard de- 
viation). Bottom: Density contour plots of r versus i band 
PSF FWHM, with contours that are logarithmically spaced 
by factors of 2.5. 



© 0000 RAS, MNRAS 000, 000-000 



10 Reyes et al. 



4.2.2 Seeing 

Given our finding that the catalogue covers a broad 
range of seeing conditions, which do have a noticeable 
impact on the source number density, we generated in- 
dependent versions of the source catalogue on stripe 82 
using 3 sets of imaging runs with good (run numbers 
2700, 2728, 4263), typical (4128, 4203), and poor seeing 
(250, 3434, 4253, 4288). In aU cases, the sky levels are 
fairly typical for stripe 82, so the difference in PSF is the 
predominant difference between the catalogues. These 
catalogues cover an area of 125 deg^ (-10 <RA< 40 for 
the full width of the stripe) and have identical coverage, 
so any differences in the observed distribution of galaxy 
properties is due to a difference in what we can detect 
given the observing conditions, and not some underlying 
difference in the galaxy populations. 

The histograms of PSF FWHM in the three ver- 
sions of the catalogue are shown in Fig. [6] the me- 
dian r (i) band PSF FWHM values are 0.90" (0.87"), 
1.15"(1.09"), and 1.32"(1.26"), going from best to worst 
seeing. For the three versions of the catalogue, the ob- 
served source number density is 1.25, 1.08, and 1.04 
arcmin"^. 

The distribution of observed galaxy properties is 
also shown in Fig. [6j including histograms of apparent 
magnitude, resolution factor, and photo-z. As shown, 
one impact of poor seeing is the inability to resolve 
shapes preferentially for the fainter galaxies. As a con- 
sequence, in good seeing, the typical source apparent 
magnitude is fainter, indicating that we are sampling 
a different intrinsic galaxy population. This effect was 
also observed in Nil. Moreover, in good seeing the typ- 
ical resolution factor R2 of the sources is higher. While 
this should obviously be the case for an identical sam- 
ple of galaxies, it was not necessarily obvious that this 
would happen since the galaxy populations are intrin- 
sically fainter and smaller in the catalogue that has 
good seeing. Finally, the Zphot distributions only change 
marginally as we go from typical to poor seeing. 

We can therefore conclude that the seeing is a ma- 
jor factor that determines the underlying galaxy popula- 
tion that gets included in the catalogue. However, Fig.|6] 
does not answer the question of whether the observed 
galaxy properties depend on the seeing, or whether the 
errors in the galaxy properties in different observations 
are correlated]^ To answer those questions, we turn to 
a direct comparison of the observed galaxy properties 
for those that appear both in the "best seeing" and the 
"worst seeing" catalogues. Due to noise and the differ- 
ent seeing values, only a subset of these catalogues are 
matched with each other (66 per cent of the poor seeing 
catalogue). Still, we compare the observed magnitudes. 



This could conceivably occur even with two observations 
that have the same seeing; one could imagine that noise fluc- 
tuations might correlate the magnitudes, resolutions, shapes, 
photo-2, and other properties. 



photo-2, resolution factors, and shapes, for the matched 
sample in Fig. [7| 

As shown in Fig [7] the resolution factor R2 is the 
only parameter that clearly varies substantially when 
comparing a matched sample with observations in good 
versus poor seeing, as one might expect given that it 
is determined by a combination of the galaxy size and 
the seeing. The shape, apparent magnitude, and photo- 
z contour plots are quite symmetric about the 1:1 line, 
which is reassuring. However, given the large matched 
sample (> 2 x 10^ galaxies), it should be possible to 
check explicitly for correlations between the different 
observations of these galaxy properties via an estimate 
of the Pearson cross-correlation coefficient Vxy. 

To do so, we calculate the differences between these 
observed quantities (Ar, etc.) and the difference be- 
tween the seeing values in the two observations, and cor- 
relate the differences with each other. The correlation 
coefficients between the various properties are shown in 
Table |2] As shown, aside from the expected significant 
negative correlation {r^y ~ —0.329 ±0.002) between the 
seeing and the resolution factor (larger FWHM results 
in a lower resolution factor), there is another correla- 
tion of similar magnitude, between the resolution factor 
and the apparent magnitude. The sign of this correla- 
tion implies that in the observation in which the galaxy 
appears to be larger (better-resolved) according to the 
second moments, it also appears to be brighter when 
measuring fiux using the model magnitudes. This cor- 
relation is expected, and in the Gaussian case, should 
have magnitude +0.707 ( [Hirata et al.|2004[ ). We do not 
expect exactly this value because the magnitudes are 
obtained via model fits, and there are selection crite- 
ria imposed on these quantities which will change the 
expected correlation. Thus we accept the observed, sig- 
nificantly positive correlation as expected despite its be- 
ing weaker than the naive Gaussian calculation would 
predict. 

There are also several parameter correlations in Ta- 
ble|2]that are small, at the few per cent level, yet statis- 
tically significant given the large sample size. For exam- 
ple, when the seeing is worse, the total ellipticity tends 
to be larger. It is unclear whether this is a true bias 
based on the different observation conditions (there are 
several types of calibration bias that could result from 
the decreased resolution or increased noise, as explained 



in Sec. 4.4 1, or whether it is merely due to the fact that a 



galaxy observed in worse seeing is spread out over more 
pixels and therefore has a noisier shape. 

There is also a small positive correlation between 
noise fluctuations in apparent magnitude and photo-z: 
those galaxies that appear fainter due to noise fluctua- 
tions also appear to be at higher redshift. Given the lack 
of luminosity prior on the photo-2;, it is not completely 
obvious that this correlation should exist. However, it 
may be tentatively attributed to the fact that if the 
galaxy SED that the photo-2 code is trying to use has 
a break in the r band (as for the 2 > 0.4 galaxy pop- 
ulation), and the galaxy seems too faint in r, then the 
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photo- z code will move the break even further into the 
red (i.e., higher redshift) to compensate. 

There are a few conclusions we can draw from the 
calculations in this section. First, the observed galaxy 
properties that do not depend explicitly on the seeing 
(such as the ellipticity, apparent magnitude, and photo- 
z), but that could have acquired some dependence on 
seeing due to issues with data processing, do not have a 
strong dependence on seeing, as one would hope. This 
statement is true even for a fairly significant difference 
in seeing values, from 0.9" to 1.4", which as shown in 
Fig. [5] and Table [T] are around the 16 and 84 percentiles 
of the observing conditions of the source catalogue. The 
second point is that there is no significant correlation 
between noise fiuctuations (or seeing-dependent system- 
atic effects) in the galaxy ellipticity and the photo-z. 
This point is very important, since we would like to 
treat photo-z systematics and shape systematics on the 



lensing signal independently (Sec. 4.4 1, and the results 
in this section confirm the validity of that approach. 



4.3 Systematics tests: shape catalogue 

4-3.1 Tests of individual galaxy shapes 

The first systematics test that we present is a com- 
parison between this and the M05 shape catalogue, for 
galaxies that appear in both. 

For this purpose, we carried out a matching process 
between the two catalogues, using a tolerance of 1". 
Due to the effects of noise on the observed resolutions 
and magnitudes, and the different versions of software 
used to process the two catalogues, the match rate is 
typically 73 per cent. The results of matching are shown 
for a ~ 100 deg'^ patch of sky in Fig. [s] we compare the 
r band magnitudes, the r-band resolution factors, and 
the band-averaged ei (comparison of 62 is similar). 

As shown in the top panel of Fig. [8j in this partic- 
ular patch of the sky, there is a small zero-point offset 
of 0.02 magnitudes in the r band, in addition to some 
scatter resulting from differences in processing. The typ- 
ical zero-point offset between the old and new catalogue 
varies from location to location, because the old cata- 
logue used older, less reliable, and run-dependent pho- 
tometric calibrations than the new catalogue. For nearly 
all of the area covered by both catalogues, the zero-point 
offset is typically several hundredths of a magnitude in 
either direction. 

The middle panel of Fig. |8] compares the r-band 
resolution factors in the M05 and new catalogue. There 
is a clear tendency for galaxies to be estimated as being 
better resolved in the new catalogue than in the old one, 
which is more significant for poorly resolved galaxies. 
This results from one of the bugs in determining the 
PSF for the shape measurement that affected the old 
catalogue (as described in Appendix [B|). The result of 
this problem is that the galaxy selection cut of _R2 > 1/3 
was intrinsically more conservative in the old catalogue 
than in the new one. 



Finally, the bottom panel of Fig. |8] compares the 
ellipticities in the new and M05 catalogue, and reveals 
a slight additive offset and a multiplicative calibration 
difference. Both of these issues are due to the bug (Ap- 
pendix [B| that led to the PSF size being slightly over- 
estimated in the old catalogue. The multiplicative offset 
is such that the galaxy ellipticities in the old catalogue 
are ~ 1.5 per cent larger, because the bug had resulted 
in overestimated sizes for the PSFs, so the dilution cor- 
rections were too large. Fortunately, this 1.5 per cent 
effect is not very large compared to the typical system- 
atic calibration uncertainties (8 per cent) that is consid- 
ered as part of the error budget in the science analyses 
with the old catalogue. The slight additive offset be- 
tween (ei) in the old and new catalogues likewise arises 
due to the PSF size misestimate in the old catalogue, 
which meant that the correction for the PSF ellipticity 
(which is systematically nonzero along the ei direction) 
was also incorrect in the old catalogue. 

We also consider, for galaxies in the new source cat- 
alogue, the comparison between r and i band shapes. 
Since the resolution factor and S/N is not necessarily 
the same in the two bands, we might expect there to 
be some systematic offset even if the light profile is the 
same in the two bands. Moreover, for bulge-|-disk galax- 
ies, we expect the bulge to be more prominent in i band 
than in r, which might lead to generally rounder shapes. 
Fig.[9]shows a comparison of the r vs. i band shapes for a 
randomly selected subsample (5 per cent) of galaxies in 
the source catalogue. As shown, there is no indication of 
any systematic offset between the shape measurements 
in the two bands. 



4-3.2 RMS ellipticity and shape measurement errors 

One quantity which is important for estimating errors, 
for optimally weighting the galaxies for shear measure- 
ment, and for deriving shears from the ensemble (via the 
shear responsivity TZ^ 1 — Crms) is the RMS ellipticity. 



which is defined per component, i.e. 



2N 



(9) 



However, the complication in estimating Crms from the 
data itself is that measurement error in the shapes can 
artificially inflate the estimated emis. Thus, in practice 
it is common to estimate 



2N 



(10) 



subtracting off the estimated measurement error. This 
equation relies crucially on accurate estimation of shape 
measurement error, and was used in M05 to estimate 
that erms for SDSS galaxies is a function of magnitude, 
ranging from 0.35 for bright galaxies to 0.42 for faint 
ones. 

However, recent work with simulated SDSS data 



(Mandelbaum et al. 20121 based on realistic galaxy 
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r-band PSF FWHM [arcscc] i?2,r ei £2 etot ^phot '" 



r-band PSF FWHM [arcsec] 1 -0.329 0.000 0.000 +0.031 -0.010 -0.032 

R2,r 1 -0.015 -0.013 -0.007 +0.010 -0.361 

ei 1 -0.001 +0.043 -0.001 +0.006 

62 1 +0.023 +0.002 +0.005 

etot 1 -0.001 +0.001 

Zphot 1 +0.080 

r 1 



Table 2. Pearson correlation coefficient Vxy between pairs of observational conditions and/or galaxy properties, for a matched 
sample of galaxies observed with good and with poor seeing. In all cases, given the large sample size, the statistical uncertainty 
^Txy = 0.002. Txy entries that are nonzero at ^ 3(t significance are shown in bold. 
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Figure 6. Each panel compares three independent versions of the source catalogue covering 125 deg2 of stripe 82, with good, 
typical, and poor seeing. Top left: Histogram of r-band PSF FWHM. Top right: Same, for r-band apparent magnitude. Bottom 
left: Same, for the r-band resolution factor R2. Bottom right: Same, for 2:phot- 
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Figure 7. A comparison of observed galaxy properties for those galaxies appearing in the "good" and the "poor" seeing 
versions of the catalogues covering 125 deg'^ on stripe 82. Top left: Density contours for the comparison between the ei shape 
components in the good- and poor-seeing catalogues, with logarithmically-spaced contours representing factors of 2.5 in density. 
The plot for 62 (not shown) looks nearly identical. Top right: Same, for the r-band resolution factor, R2. Bottom left: Same, 
for the apparent model magnitude r. Bottom right: Same, for ^phot- 



models from the Cosmic Evolution Survey (COSMOS; 
Scoville||2007 \ strongly suggest that the cTe values used 
in this shape catalogue are underestimated, such that 
when we correct for the underestimation, the Crms is no 
longer a function of magnitude (or at most, is a weak 
function of magnitude). Here, we present additional ev- 
idence for this issue based on the data itself. 

The key part of this analysis is that we have shape 
measurements in two bands (r and i) that we expect 
to be the same (modulo very tiny differences due to 
colour gradients). Thus, we can consider the r and i 



band shape measurements to be measures of the same 
intrinsic quantity, with two different noise realisations. 
This means that if we estimate 



(11) 



for a — (1,2) (components), r and i representing the 
bands, and j representing the galaxy, then we expect 
a histogram of the Aeaj values to be approximately a 
Gaussian with a mean of zero and a standard deviation 
of 1. 
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Figure 8. Each panel shows a comparison between observed 
galaxy properties in the M05 and the new source catalogue 
presented here, for a representative subsample of the area. 
Top: Difference in r band apparent model magnitudes, where 
the points show values for randomly subsampled galaxies, the 
solid line shows the median trend line as a function of mag- 
nitude in the old catalogue, and the dashed lines show the 68 
per cent CL. The heavy diagonal line shows a limit imposed 
by the fact that we required r < 21.8 in both catalogues. 
The horizontal dotted line shows the ideal value of 0. Mid- 
dle: Same, for r-band resolution factor R2. Bottom: Same, 
for the band-averaged first ellipticity component ei. 




e, (r— band) 



Figure 9. A comparison of the r-band ei and the i-band 
ei on a per-galaxy basis. The contours shown are logarith- 
mically spaced by factors of 2.5 in density; the line shows 
the ideal 1:1 line. A corresponding plot for the e2 component 
(not shown) looks identical. 

As shown in Fig. [lO] for a random subsample (5 
per cent) of the source catalogue, the histogram differs 
from the expected one in two ways: first, the best-fitting 
Gaussian has a standard deviation of 1.43 rather than 
one, and second, there are tails to |Ae| ^ 5 that exceed 
our expectations for a Gaussian distribution. This plot 
represents empirical evidence that our shape errors are 
underestimated and non-Gaussian. 

We determine an approximate correction for this 
underestimation of the shape measurement errors (still 
in the Gaussian approximation) assuming that it is a 
function of apparent magnitude, resolution, and elliptic- 
ity itself. The correction takes the form of independent 
third-order polynomials in each of those three quan- 
tities, with the correction being more important for 
bright, poorly-resolved, and/or round galaxies. Once we 
apply the correction, we can estimate the Brma as a func- 
tion of apparent magnitude and other properties, for 
which the results are shown in Fig. |11| 

As shown in the top panel of Fig. which uses all 
sources in a random 5 per cent of the source catalogue 
without imposition of any additional cuts or weight fac- 
tors, the underestimated shape measurement errors led 
to what seemed like a significant evolution in the value 
of erms with magnitude, from 0.36-0.42. However, after 
correcting for the underestimation of the shape mea- 
surement errors, we see that the firms curve is much 
closer to fiat, ranging from 0.36-0.38. 

However, what really matters when computing the 
signal is the Crma for the source galaxy population be- 
hind a given lens, including all weight factors from Eq.[5] 
In the bottom panel of Fig. |11| we show what happens 
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Figure 10. Top: Histograms of difference between tlie shape 
measurements in the two bands, normalised by the expected 
shape measurement errors. As shown, the histograms are es- 
sentially identical for the two shape components, and differ 
drastically from the expected Gaussian with standard devi- 
ation of 1 (long-dashed line). The best-fit Gaussian instead 
has a standard deviation of 1.43, but the distributions clearly 
deviate from a Gaussian in that they have too much weight 
in the central peak and in the tails, and not enough at inter- 
mediate values (large kurtosis). Bottom: Cumulative prob- 
ability distributions corresponding to the histograms in the 
top panel. 



to the ernia (including the corrected shape measurement 
errors) when we impose cuts on the photo-z, requiring 
Zphot > 0.1 — a cut that corresponds to the maximum 
lens redshift for the sample used in this paper — and also 
when we include the lensing weight factors when calcu- 
lating erms- As shown, when we include the Zphot > 0.1 
cut, the estimated erms decreases somewhat, indicating 
that the source galaxy population becomes somewhat 
rounder. The inclusion of lensing weights exaggerates 
this effect even further. 

This discussion ignores the non-Gaussianity in the 
shape measurement errors, demonstrated by the kur- 
tosis leading to tails to large values in Fig. |10[ Using 
the noise probability distribution from that plot, rather 
than a Gaussian one, we estimate that the non-Gaussian 
nature of the noise has an equal impact on Crms as if 
the shape measurement errors were Gaussian but with 
a standard deviation that is a factor of 1.18 larger. 
When we include this in the calculations that go into 
Fig. |11[ the long-dashed line in the top panel becomes 
even closer to flat with magnitude than it already is. 

Ultimately, we will need to use these results about 
the Grms when we determine the shear calibration bias 
using the new catalogue in Sec. |4.4| We conclude based 
on the corrected shape measurement error estimates and 
our assessment of the noise probability distribution that 
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Figure 11. Top: RMS ellipticity estimated estimated as a 
function of magnitude in three ways: without subtracting 
the shape measurement errors (solid line), after subtracting 
the ones estimated using Eg. |A7| (dashed line), and after sub- 
tracting the ones that are corrected for their underestimation 
as in Sec. |4.3!2| (long-dashed line). Bottom: RMS ellipticity 
estimated using the corrected shape measurement errors, for 
all sources (same as top plot); sources passing a 2phot > ^lons 
cut for zions = 0.1; sources with the same photo-z cut and 
including the source weighting from Eq. ([sjl; and same as the 
previous but with zie^s = 0.3. 



an appropriate value of erms (including the source photo- 
z cut implied by the requirement that Zphot > zicns, 
and the source weighting), is in the range 0.35-0.37. 
This implies that the true shear responsivity should be 



7^= 1 



0.863-0.878. 



As discussed in Mandelbaum et al. (20121, a flat 



Brms with magnitude is also consistent with space-based 
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data from COSMOS ( [Leauthaud et al.|[2007| ). While 
the actual values of e^ms differ (0.27 there, versus 0.36 
here), this is a consequence of the different shape es- 
timators used for the COSMOS versus for the SDSS 
data, rather than a true disagreement in the intrinsic 



galaxy shapes ( Mandelbaum et al. 2012 1. The circularly- 



weighted shape estimators used for the COSMOS galax- 
ies will tend to less elliptical (rounder) measurements 
than the adaptive moments used for SDSS. 



4.4 Lensing signal calibration 

To estimate the calibration of the lensin g sig nal mea- 



sured as weighted sums over 7tSc (Sec. 2.3 1, we con- 
sider several types of calibration biases, from M05 and 
subsequent work (e.g. Mandelbaum et al. 2008bJ . We 



then combine the estimated biases by multiplying them 
all together, assuming that they are independent. We 
combine the la calibration uncertainties in the follow- 
ing way: those that are roughly independent are added 
in quadrature, whereas those that cannot be considered 
independently from each other (since they arise due to 
e.g. related issues in the data reduction or galaxy selec- 
tion) are added linearly. We therefore consider calibra- 
tion biases and uncertainties due to all of the following: 

(i) Calibration uncertainty due to misestimation of 
Ec due to use of photo-z for sources. 

(ii) Stellar contamination due to incorrect inclusion 
of stars in the shape catalogue. 

(iii) PSF model uncertainty. 

(iv) Shear responsivity error due to mistaken erms es- 
timate resulting from incorrect shape measurement er- 
ror estimates (as in Sec. |4.3.2| ). 

(v) Three different shear calibration biases that we 
cannot consider independently, and that have uncertain- 
ties that add linearly: 

(a) PSF dilution 

(b) Noise rectification bias 

(c) Selection biases 

We describe each of these separately and conclude 
this subsection with a final tally of the systematic error 
budget. 



4-4- i Photometric redshift errors 

Nil addressed the question of how photo-2 errors for 
these ZEBRA photo-z impact the calibration of the 
galaxy-galaxy lensing signal, as a function of lens red- 
shift. Here, we simply use the results from Nil directly, 
which results in a calibration bias estimate of 2.0 ± 0.5 
{la) per cent (the same for each stellar mass bin due to 
their similar redshift distributions). 



lamination in the galaxy-galaxy lensing signal. Out of 
4 290 source galaxies overlapping the COSMOS region, 
32 are stars, or 0.75 per cent. However, to constrain 
their impact on the shear calibration, we cannot simply 
use the fractional contamination; we have to include the 
lensing weights to calculate the fractional weight given 
to stars, and account for the fact that we require source 
•Zphot > 2iens- When we take this into account, we es- 
timate stellar contamination of 0.64 per cent, with la 
Poisson uncertainties of [—0.15,-1-0.12] per cent. 

However, there is an additional uncertainty hav- 
ing to do with the fact that (a) the stellar density is 
not always the same as in the COSMOS field, and (b) 
the observing conditions in the COSMOS field, includ- 
ing the atypically high sky noise (see Nil for details), 
may affect the infiuence of incorrectly including stars 
in the catalogue. The first issue is likely not very sig- 
nificant, since the stellar number density depends on 
galactic latitude according to 1/| sinb|. On average, this 
quantity is equal to 1.43 over the COSMOS field versus 
1.40 over the whole catalogue (the difference is not se- 
vere because we exclude regions near | sin 6| ~ with the 
Ar < 0.2 cut). Because of systematic uncertainty associ- 
ated with the second point, we double the la statistical 
error, which ends up giving us a 50 per cent uncertainty 
in the stellar contamination: 0.64 ± 0.3 per cent. Note 
that stellar contamination biases the shear to be lower, 
so the sign of the bias is negative, i.e., —0.64 per cent. 



4-4-3 PSF model uncertainty 

We follow the method of [Hirata et al.| ( |2004[ ) for han- 
dling PSF model uncertainty. The basic idea is if the 
PSF model is not correct, then the PSF correction can 
introduce systematics into the shape measurement. If 
these errors in the PSF model are purely statistical, 
then we can expect that they will lead to shear cali- 
bration biases that will ultimately cancel out using a 
large enough area. However, systematic issues with the 
PSF model size will lead to coherent tendencies to over- 
or under-correct for the dilution of the galaxy shapes 
due to the PSF, so we would like to constrain such sys- 
tematics 13 

We first estimate systematic errors in PSF elliptic- 
ity {6ei, Se2) and trace, 5T^^^ /T^^\ using real stars 
drawn from randomly selected fields in the source cat- 
alogue. In each randomly selected field, we use all stars 
passing basic flag cuts and with 18 < r < 20. The moti- 
vation behind using this range of magnitudes is that (a) 
they are faint enough that this is a non-trivial test of 
the PSF model because PSF star^only go as faint as 
r ~19, and (b) they are bright enough that the moments 
are not too noisy and contamination of the star sample 



4-4-2 Stellar contamination 

Using the calibration sample from Nil and space-based 
data from COSMOS, we can constrain the stellar con- 



Systematic issues with the PSF ellipticity typically cause 
coherent additive shears rather than shear calibration biases; 
see Sec. [4X2| 
^ R. Lupton, priv. comm. 
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by galaxies is not overly large ( Lupton et al.|2doT l. For 
each of the 5 x 10^ stars, we find, in both r and i bands, 
the following; 



Aei = ei,star — ei,pSF 

Ae2 = e2,star — e2,PSF 

AlnT = (Tstar — TpSF)/rpSF, 



(12) 



where T is the trace of the moment matrix (Eq. Al I 
and the "PSF" quantities are those of the fuU KL PSF 
model extrapolated to the position of that star. 

The results of this test are shown in Fig. [12] The 
upper left panel shows the histograms of Aei, Ae2, and 
AlnT in the r band (i band is qualitatively similar). 
Before generating this histogram, we removed the small 
fraction of outliers that had > 5cr discrepancies between 
the star and PSF model, assuming that these are due to 
contamination of the star sample rather than true mod- 
eling failures. As shown, the PSF model ellipticities ap- 
pear to be quite accurately estimated. However, the PSF 
trace is skewed towards positive values. Note that this is 
exactly the signature we might expect from contamina- 
tion of our test sample by poorly resolved galaxies (since 
we have not imposed precisely the same set of cuts that 
are used to select stars for PSF reconstruction). As a 
result, we cannot definitively state that these charac- 
teristics of the histogram are due to true PSF model 
failure. Moreover, even if it is due to PSF model failure, 
the actual size of the failure is (in the mean) quite small, 
< 1 per cent. But a difficulty with this explanation is 
that when we split the star sample used for these tests 
into r < 19, which may have been used for PSF model 
estimation, and r > 19, which should not have been, we 
find that the deviations between the star sizes and the 
PSF sizes are actually more significant for the brighter 
sample than for the fainter one. 

The other panels of Fig. |12| show trends in the quan- 
tities from Eq. [12] with camcol, where the results in 
camcol= show the results for all camcols averaged 
together. As shown in the bottom panels, while the me- 
dian and mean deviations in the PSF model elliptici- 
ties are of order 5 x 10~^ (with the median and means 
agreeing quite well, given that the distributions in the 
upper left panel are well behaved), there is an inter- 
esting trend with camcol for the r-band, with the r- 
band PSF model ellipticity in camcol 2 being on average 
wrong by ~ 3 X 10""^ (about 20 times worse than for the 
data in all camcols, overall). This trend was originally 
noted in Huff et al. (2011, m prep.), and it means that 
if we average the galaxy shapes in camcol 2, they have 
a small systematic deviation from zero as well, because 
the wrong PSF ellipticity was used in the PSF correction 
process. Possible explanations include improper nonlin- 
earity corrections on the r-band camcol 2 CCD, because 
the stars used to create the PSF model have significant 
nonlinearity corrections, but the stars used for this test 
and the galaxies typically used for the shape catalogue 
do not. Fortunately, while this does impact large-scale 
systematic additive shears, it does not affect the shear 



calibration since the PSF model size is not nearly as 
affected as its ellipticity. 

The upper right panel shows statistics of A In T in 
both r and i band. The median and mean values dif- 
fer because the distribution is, as shown in the top left 
panel, noticeably skewed. There is no drastic trend with 
camcol (as shown in the upper right panel). For the 
value of A In T that goes into our estimate of shear cal- 
ibration bias due to improper PSF modeling, we will 
use a lower limit of zero, because the trends in this plot 
could in principle be caused by contamination of the 
"star" sample by some number of very poorly resolved 
galaxies. Our upper limit will be the upper line on the 
upper right panel, which gives 0.01 (1 per cent error in 
the PSF trace). Note that our lower and upper limits 
are not driven by the spread in the histogram in the up- 
per left panel, because we assume that shear calibration 
biases from deviations in the PSF model due to noise 
will simply average out. So, we only worry about how 
well we can constrain the typical value of A In T. 

Then we use 



(57 
7 



= -iR2 



1) 



(13) 



from Hirata et al. ( 2004 1 (accounting for the difference 



in the sign convention in the definition of A In T from 
Eq. 121. This equation says that the shear calibration 



bias due to incorrect PSF size estimation is more impor- 
tant for poorly resolved galaxies than for well-resolved 
galaxies, as one would expect. The sign of the effect is 
such that if the PSF size estimate is biased low, the 
shape measurements would also be biased low. our cor- 
rections are not large enough. This means that the shape 
measurements are too low. When we include the fact 
that the average value of {-RJ^ — 1) for the catalogue 
is 0.70 (using Zphot > -Ziens for the lens redshifts used 
in this work), and calculating a weighted mean using 
the lensing weights from Eq. [5] we estimate an uncer- 
tainty for the shear calibration bias due to systematic 
PSF reconstruction errors to be in the range [0.0, 0.007]. 
In practice, we assume a mean bias in the shear due to 
PSF model errors of —0.4 ± 0.4 per cent. 



4-4-4 Shear responsivity error 



As shown in Sec. |2.3[ our shear estimator relies on the 
fact that the shape definition in Eq. ([sjl responds to 
shear in a particular way: 5e+/5j+ Ri 2(1 — e^^^) = 27?., 
where the shear responsivity depends on the intrinsic 
RMS ellipticity per component for the source sample. 
As described in Sec. |4.3.2| accurate estimation of TZ re- 
quires us to be able to subtract the shape measurement 
error to get the 'true' RMS ellipticity of the population. 
We have already shown there that our shape measure- 
ment errors are underestimated, leading to an overes- 
timation of erms, underestimation of TZ, and overesti- 
mation of the shear. As we concluded there, the true 
value of ernis should be in the range 0.35-0.37, giving 
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Figure 12. Plots relating to PSF modeling failures. Top left: Histograms of r-band Aei, Ae2, and AlnT for a sampl e of 
~ 5 X 10^ stars in randomly selected fields in the source catalogue, where the quantities that are plotted are defined in Eq. | |12| | 
and associated text. Top right: The typical value of AlnT as a function of camcol, where the value at camcol= is averaged 
over all 6 camcols. The size of the statistical errors are similar to the size of the points themselves, so they have been omitted. 
The four lines, as labeled in the lower left panel, show results for the two bands (r and i) and show the median and mean trend, 
which differ because the distribution (shown in the top left panel) is skewed. Bottom left: Same as top right, for Aei. Bottom 
right: Same as previous, for Ae2. 



using the shape measurement errors in the catalogue, 
we had estimated TZ= 0.848. This means that our mean 
calibration bias for which we must correct is +2.5 per 
cent, and its la uncertainty is, conservatively, ±1 per 
cent. 



4.4.5 PSF dilution 

PSF dilution is the rounding of the galaxy shape due 
to the PSF. A key purpose of PSF-correction is to ap- 
ply a correction for the PSF dilution. The accuracy of 
the re-Gaussianization dilution correction, as a func- 
tion of the galaxy resolution factor and Scrsic index, 
was shown by Hirata & Seljak ( |2003J and ^ Mandclbauni| 



et al. (2005a I using noiseless idealised simulations; more 



recently, Mandelbaum et al. ( 2012 1 did the same test but 
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using a realistic distribution of galaxy morphologies (in- 
cluding small-scale structure, starting from COSMOS 
galaxy images) and SDSS PSFs. 

The simulations in Mandelbaum et al. (2012) 



specifically mimicked the SDSS imaging conditions in 
the COSMOS field, which have slightly better seeing 
and somewhat higher sky noise than typical SDSS data. 
From the results shown for the noisy simulations there, 
we can directly derive the calibration bias due to PSF di- 
lution and noise rectification bias together, rather than 
deriving separate estimates for each one. 



4.4-6 Noise rectification 

Noise rectification bias is bias in shear estimates due to 
the finite S/N of the galaxy images. Because the estima- 
tion of the galaxy shape is a nonlinear process involving 
measurement of galaxy moments, the noise in the origi- 
nal image means we cannot necessarily estimate galaxy 
shapes that are unbiased in the presence of noise. |Hi-| 



rata et al. ( 2004 1 and M05 gave analytic approximations 



for the noise rectification bias for methods such as re- 
Gaussianization that rely on using adaptive moments of 
the image to estimate the shape. As shown there, the 
magnitude and sign of the effect depends on how well 
resolved the galaxy is, but it becomes large and positive 
for poorly resolved galaxies, which is the motivation for 
our imposition of a cut on resolution factor at J?2 = 1/3. 
In this work, we rely on the simulated SDSS data de- 



scribed above, from Mandelbaum et al. ( 2012 1. The idea 



is that simulating data without any noise allows us to 
derive a calibration bias due to incorrect PSF dilution 
corrections, and when we add noise to the simulations, 
we are measuring a combination of the calibration bi- 
ases from both the incorrect dilution corrections and the 
noise rectification bias. 

The resulting bias and uncertainty on that bias are 
—6 ± 2 per cent (the —4 ± 2 per cent from Mandelbaum 



|et al.|[2012| included, as discussed there, a +2 per cent 
bias due to shear responsivity errors, which we want 
to treat separately for the purpose of this paper). Ifow- 
ever, there are additional effects that must be taken into 
account in any realistic analysis, which we account for 
here. 

First, we must consider the fact that in [Mandel-^ 



baum et al. (20121 we applied only a crude galaxy se- 



lection to the simulated galaxies: rather than process- 
ing the simulations with Photo so that we could cut 
on the model magnitudes, we simply applied an esti- 
mated S/N cut (using the estimated a-y based on the 
sky noise, ellipticity, and resolution factor). The other 
cuts were similar to those employed here, on the reso- 
lution factor and total ellipticity. The first question we 
face is whether the galaxy population that results is sim- 
ilar in S/N and resolution factor to the observed one in 
the real catalogue. To avoid issues of sample variance, 
we ask this question specifically using the real source 
catalogue in the COSMOS region. There, we find that 
there are 3 695 galaxies with COSMOS galaxy postage 



stamps used for the simulation that pass all cuts in the 
real data, as compared with 3 680 in the simulations. 
These numbers are statistically consistent once we ac- 
count for the fact that different random noise fields were 
added to the simulations than to the real data. This 
finding suggests that the cuts in the simulations cannot 
be effectively too different from the model magnitude 
cuts in the real data. Moreover, the two-dimensional 
distributions of resolution factor and S/N were nearly 
consistent, with the effective mean values the same in 
the simulations as in the real data to within 3 per cent. 
Given that the shear calibration was only found to be a 
weak function of those properties in [Mandelbaum et al.| 
p012l ), we assume that the effect of the crude cuts im- 
posed on the simulations leads to a negligible change 
in the errors due to PSF dilution corrections and noise 
rectification bias with respect to those in the real data. 

The next issue is the fact that the shear calibra- 
tion bias depends on the galaxy population, which (as 
shown previously in Sec. 4.2 1 depends on the observing 



conditions. Therefore, we must simulate SDSS data with 
other observing conditions besides that in the COSMOS 
field. To carry out this test, we chose 8 random locations 
within the footprint of the source catalogue, and sim- 
ulated the COSMOS field as it would have looked at 
those positions, including the full PSF model and sky 
noise. We then checked how much the shear calibration 
depends on the conditions. While detailed results will 
be shown in Mandelbaum et al. (2011, in prep.), we find 
that the shear calibration bias (in contrast to the bias 
due to photo-z, see Nil) is not demonstrably a function 
of observing conditions when we allow them to vary in 
a reasonable way as sampled by these random points, 
including variations in PSF ellipticity, PSF size, PSF 
kurtosis, and sky noise. For the eight random positions, 
we find a shear calibration bias due to PSF dilution 
of —4.8 ± 2.5 per cent, which is statistically consistent 
with the — 6±2 per cent from ( [Mandelbaum et al.|2012] ) 
within our claimed 2 per cent (la) uncertainty. 

In addition, we must include in the simulation 
such realistic effects as the selection of galaxies with 
Zphot > zicns and the galaxy weighting by 1/E^ (Eq.jHJ, 
rather than taking unweighted averages of all galajc- 
ies passing basic shape cuts as in ,Mandelbaum et al.| 
(20121. We have carried out several tests of this issue; 



the basic tests have included first cuts on photo-z and 
then the realistic weighting, with two possible photo-z. 
The first are the COSMOS photo-z ( [Ilbert et al.|2009| l, 
which have an RMS uncertainty < 0.01 for the apparent 
magnitudes considered here. The second are simulated 
SDSS photo-z that start with the COSMOS photo-z and 
put in the error model estimated by Nil for the SDSS 
ZEBRA photo-z[^ These tests reveal that the average 



We cannot include photo-z as part of our simulation of 
SDSS data directly, because it would require us to (a) sim- 
ulate more than just i band and (b) measure galaxy colours 
using the same method as for real data, including process- 
ing the simulations with the SDSS Photo pipeline. However, 
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calibration bias due to PSF dilution and noise rectifi- 
cation bias changes by —0.5 per cent (i.e., significantly 
less than our quoted uncertainty of 2 per cent from the 
simulations in Mandelbaum et al. 20121. We therefore 



<57 _ ^2,m in(l 

7 



i?2 



7^ 



(14) 



correct for a systematic bias due to these two effects of 
—5.3 per cent — the —4.8 per cent averaged over random 
positions within SDSS with a —0.5 per cent correction 
due to weighting effects. Our estimated la uncertainty 
is ±2 per cent. 



4-4-'^ Selection bias 

The final shear calibration bias that we consider is se- 
lection bias. As described in Hirata et al. (20041, the 



dominant selection bias for our catalogue is due to the 
responsivity cut of _R2 > 1 /3 in both bands. For a galaxy 
of a given area, those that are more elongated will have a 
larger R2 than those that are roundQ This means that 
a shear will effectively increase the R2 value of any given 
galaxy. For a galaxy near the resolution limit, this means 
that if its intrinsic (pre-lensing) shape is aligned with 
the weak lensing shear, then its ellipticity and there- 
fore R2 will be increased and it may be included in our 
sample, whereas those that are anti-aligned with the 
shear will be made more round and therefore have a 
reduced R2, so they might fall out of our sample. Ef- 
fectively, this means that the assumptions behind any 
weak lensing analysis, that the mean tangential galaxy 
ellipticities should go to zero in the absence of lensing, 
will be violated. The overall effect of this selection bias 
is to enhance the estimated shear, unless there is also 
an upper limit on resolution which will give an effect of 
opposite sign (the relative magnitude of the effects due 
to lower and upper limits on resolution depend on the 
resolution factor distribution). 

Unfortunately, the simulations based on COSMOS 



data from Mandelbaum et al. (20121 that are used to 



estimate the magnitude of shear biases due to incorrect 
PSF dilution corrections and noise rectification biases 
do not include this particular calibration bias. The rea- 
son for this is that those simulations were generated 
using pairs of galaxies that were identical but for a 90 
degree rotation. When the galaxy pairs are sheared, one 
has its ellipticity and R2 increased, and the opposite oc- 
curs to the other, so requiring R2 > 1/3 for both will 
not cause the same selection bias as in the real data, 
where we require R2 > 1/3 for two noise realisations 
with the same orientation (i.e., the r and i band data). 

To estimate this selection effect, we rely on the ar- 
guments in M05 that, in the Gaussian approximation, 
we can use 



as demonstrated in Sec. |4.2| the photo-2 errors and elliptic- 
ity errors are not correlated between different observations, 
so it is fair to treat them separately rather than within one 
self-consistent framework. 

One could imagine redefining the galaxy resolution in a 
way that does not lead to such a selection bias. 



Here, n(_R2,min) comes from the histogram of R2 values 
derived from using, for each galaxy, the minimum value 
of R2 in r or i band, and evaluating the histogram at 
the lower bound of 1/3. The reason for this is that our 
selection imposes R2 > 1/3 in both bands, and therefore 
the lower of the two determines whether a galaxy makes 
it into the sample. We have plotted this histogram in 
Fig. [2] however, for the sake of this calculation, we must 
use a weighted histogram that includes (a) the photo-2: 
cuts for Zphot > zians and (b) the weighting that goes 
into calculation of the lensing signal. When we do this, 
we estimate 57/7 ~ 5 per cent. Given the approxima- 
tions going into this calculation, we assign a la uncer- 
tainty to this calibration bias equal to half its value, or 
±2.5 per cent. 

We have attempted a more precise empirical es- 
timate using the simulations from [Mandelbaum et al.| 
(20121. To do so, we impose our selection criteria on 



galaxy pairs from two independent noise maps of the 
galaxy with the same orientation (to mimic our real 
SDSS selection in r and i). In this case, we expect the 
selection bias to operate in a similar way as in the real 
data. We can do this again using galaxy pairs from two 
independent noise maps of the 90 degree rotated orien- 
tation, which will have the same selection bias, and then 
average those results with the first set of results to beat 
down the noise. In this case, however, the shape noise 
cancellation is no longer exact (because there are many 
boundary cases where the original or rotated orienta- 
tion does not get included, either due to noise or the 
selection bias) and therefore the results are more noisy. 
However, the preliminary estimate of this effect is half 
as large as the one from the analytic estimate, or 2.5 
per cent, and is likely more reliable given that it has far 
fewer assumptions about the galaxy models and the way 
the resolution factors respond to shear for those models. 
For the purpose of this work, we therefore assume a cal- 
ibration bias due to selection bias of 2.5 ± 2.5 per cent, 
and defer a more precise estimate from simulations to 
future work, Mandelbaum et al. (2011, in prep). 

Note that there is an additional possible selection 
bias, due to our requirement that etot < 2. However, any 
bias in the shear due to this cut will have been implicitly 
included in the estimates of shear calibration bias from 
the COSMOS-based simulations used to estimate PSF 
dilution corrections and noise rectification bias, so we 
do not consider it separately. 

4.4-8 The total systematic error budget 

We find that the calibration bias due to calculating the 
lensing signal estimated using the procedure in Sec. |2.3| 
after combining all effects from the rest of this section, 
is +0.5 per cent. Therefore, for figures in this paper, we 
multiply the signal by a factor of 0.995 (accumulating 
all factors from this subsection). In practice, when per- 
forming fits to the signal, we apply the inverse of this 
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calibration factor to the theoretical signals before com- 
paring with the data, rather than doing anything to the 
data. 

Then, as described at the start of Sec. |4.4[ we first 
add the uncertainties for the last three types of bias 
linearly, because they are all related. This gives a la 
calibration uncertainty due to these three effects of 4.5 
per cent. These are then added in quadrature with the 
first four effects, to give a total calibration uncertainty 
of 5 per cent. We defer work to lower this systematic 
error budget, which is dominated by shear calibration 
effects, to future work. 

As a basic test of our understanding of shear sys- 
tematics, we also estimate the relative calibration bias 
when computing lensing signals using the source sam- 
ple split into r < 21 and r > 21 samples containing 60 
and 40 per cent of the galaxies, respectively. Given the 
nearly identical range of Zaic covered by these samples, 
we expect theoretically that the lensing signals should 
be the same, and as in M05, we exploit this to test for 
shear calibration biases that might differ for the differ- 
ent galaxy populations. Carrying through the same cal- 
culations as in the previous subsection, but for r < 21 
and > 21 sources separately, we estimate calibration 
biases of —0.3 and —3.3 per cent respectively. We will 
test in Sec. |5.2.3| whether the observed signal ratio for 
the two samples is consistent with our understanding 
of the shear calibration after correcting for those fac- 
tors (which provides a basic test of our combined un- 
derstanding of all the above systematic errors). 

4.5 Scale-dependent shear systematics 

In M05 and several subsequent works (most notably 
Mandelbaum et al.||2006a|b l issues were raised regard- 
ing systematic errors in the shape catalogue that would 
cause scale-dependent systematic errors in the lensing 
signal. We discuss several such observational effects in 
turn. Note that scale-dependent issues relating to the- 
oretical uncertainties can be resolved at the stage of 
modeling, so they are not discussed here. These include 
the impact of lensing magnification (previously studied 
Mandelbaum et al.||2006b negligible here), dust ex- 



tinction (for dust associated with the lens galaxies on 
large scales, as in Menard et al.|20"l0 l, and the possible 
need to model the contribution to the mass profile from 
the stellar component of the lens galaxies are discussed 
later, in Sec. |6.1| 



4-5.1 Sky subtraction 

A point that was raised in M05, and further quanti- 
fied in Mandelbaum et al. (2006b I, is that the version 



of the SDSS Photo pipeline used up through DR7 has 
difficulty properly estimating the sky around large and 
bright galaxies, such as those that are typically used as 
lens galaxies for galaxy-galaxy lensing. As described in 



which alleviates some of the original problem, but not 
all. Since galaxy-galaxy lensing measurements require 
the robust detection and measurement of apparently 
faint galaxies (sources) nearby bright ones (lenses), the 
residual sky subtraction problems can affect the lensing 
signal computation up to ~ 100" which corresponds to 
41 kpc a.t z — 0.02 or 185 kpc at z = 0.1 (these redshifts 
bracket the typical redshifts of the lens sample used for 
this work). 

The sky subtraction error can have two potential 
impacts on galaxy-galaxy lensing measurements. First, 
we may fail to detect some faint galaxies due to the sky 
being overestimated. Since we normalise by the num- 
ber of galaxies around random points (in order to re- 
move the effects of physically-associated sources), we 
would end up multiplying our signal by a normalising 
factor that is too low, and therefore underestimating 
the galaxy-galaxy lensing signal. Second, unsubtracted 
sky gradients could in principle impart some tangential 
shear to the source galaxies. Thus, we must check for 
both errors in the number counts of faint galaxies near 
bright ones, and systematic errors in the shears. 

For this test, we rely on the apparent magnitude 
histogram of the parent disk lens sample, in Rll, which 
is dominated by galaxies fainter than r ~ 17, along with 



figure 4 in Aihara et al. (20111. The lower right panel 



of that figure shows that for 17 < r < 17.5 lenses, the 
number density of detected sources will be suppressed 
by ~ 2.5 per cent for 20 < 9 < 80". This suggests that 
our boost factors and therefore measured lensing signals 
must be underestimated by about this amount on those 
scales (< 33 kpc for lenses a.t z — 0.02, and < 150 kpc 
for lenses at 2; = 0.1). 

In addition, we rely on some additional tests — not 
shown in Aihara et al. (2011 1 — using stars as "lenses" 



and looking for a tangential shear signal using the source 
catalogue. The idea is that there should not be any sig- 
nal, so we can assume that observed signals are due to 
incorrect sky estimation leading to gradients that turn 
into tangential or radial shears. While we find strong 
evidence for such shear signals using the M05 catalogue 
to = 25", they are reduced with the new catalogue 
such that they are only evident for 6 < 10" (at the 4a 
level) . Since we do not use such scales for science, we do 
not have to worry about this. 



4-5.2 Systematic shear 



As was clearly demonstrated in [Mandelbaum et al.| 
(2006a I, there is a slight "systematic shear" in the 
catalogue — a coherent smearing of the PSFs and the 
PSF-corrected galaxy shapes along the scan direction, 
due to the fact that the re-Gaussianization PSF cor- 
rection method as implemented for this work allows 
~ 5 X 10~^ of the PSF ellipticity to leak into the es- 



timated shears (Mandelbaum et al. 20121. Given the 



Aihara et al. (2011 1, the new version of Photo used for 



DR8 (v5_6) has an improved sky subtraction algorithm 



factor of shear responsivity and the typical PSF ellip- 
ticity of ~ 0.05 in SDSS, this corresponds to an additive 
systematic ellipticity of |esys| ~ 3 x 10~* per galaxy. 



© 0000 RAS, MNRAS 000, 000-000 



22 Reyes et al. 



As a rule, this systematic shear is not a problem 
for galaxy-galaxy lensing, since the azimuthal averag- 
ing of the tangential galaxy shears at a given transverse 
separation R removes it from the measured lensing sig- 
nal. However, this is no longer true once we are at large 
enough R that survey edge effects are important. To 
lowest order, we can then use the lensing signal around 
random points to remove this systematic shear from the 
real galaxy-galaxy lensing signal, provided that any cor- 
relations between the lens density and the systematic 
errors that determine the size of the systematic shear 
are also obeyed by the random catalogue. In Sec. |5.2.T| 
we will show the galaxy-galaxy lensing signal around 
random points obeying the same redshift distribution 
and area coverage as our real lenses, and demonstrate 
that for this work, we are still on sufficiently small scales 
that the systematic shear is irrelevant. We defer a dis- 
cussion of larger scales, where it may be important, to 
future work (Mandelbaum et al. 2011, in prep.). 



10= 




R (kpc) 



5 GALAXY-GALAXY LENSING SIGNAL 

In this section, we present the galaxy-galaxy lensing sig- 
nal of the lens sample (Sec. |5.l] ). Then, we describe and 
present the results of several tests of systematics on the 
lensing signal: namely, the random points test, the 45- 
degree test, and the ratio test (Sec. |5.2]). 



5.1 Lens sample 



Figure 13 shows the measured lensing signals AE(7?) for 
our lens sample in three stellar mass bins with lensing- 
weighted mean stellar masses of 0.62, 2.68, and 6.52 
xIO^^Mq (inverted blue triangles, green circles, and 
red triangles, respectively). The amplitude of the mea- 
sured lensing signal increases with stellar mass, indicat- 
ing increasing halo mass, as expected. We perform fits 
to model density profiles (shown by different curves in 
Fig. |13[ ) to determine halo masses in Sec. [6] 

Table[3]lists the basic properties of each stellar mass 
bin, including the range in M» , the number of galaxies in 
the bin A'gai, the unweighted and lensing- weighted mean 
stellar mass (cols. 2-5). The lensing- weighted mean ro- 
tation velocity (col. 6) is calculated using the TFR, 
Eq.[7l 

shows the boost factors B{R) applied to 



Figure 



14 



these lensing signals (symbols refer to the same bins). 
We find that B{R) is consistent with 1 for _R > 200 kpc 
in all three cases, as expected. 



5.2 Tests of systematics 

In this section, we present tests of systematics on the 
lensing signals calculated using the procedure in Sec. |2.3| 
and corrected for the calibration bias factors estimated 
in Sec. ITl 



Figure 13. Measured lensing signals AS(_R) around stacked 
disk galaxies in three bins with weighted mean stellar 
masses of 0.62, 2.68, and 6.52 X 10^*^ Mq, shown as blue in- 
verted triangles, green circles, and red triangles, respectively. 
Also shown are the best-fitting one-halo and halo-halo pro- 
files (black dotted and red dashed curves, respectively), the 
estimated baryonic component (blue dot-dashed curves) and 
the sum of these three (black solid curves). The range of 
scales used for the fits is R = 50-2000 kpc. For this range of 
scales, the baryonic contribution is negligible and the halo- 
halo contribution is sub-dominant to the one-halo term. We 
model the lensing signal as a sum of the one-halo and halo- 
halo profiles (as described in Sec.|6]l. 



5.2.1 Random points test 

Figure [Ts] shows the lensing signal ASrand(^) around 
stacked random galaxies for three M« bins with 
weighted mean stellar masses of 0.62, 2.68, and 6.52 
X lO^^M© (blue inverted triangles, green circles, and red 
triangles, respectively). 

As shown, these signals are approximately consis- 
tent with zero, which indicates that additive shear sys- 
tematics do not significantly affect the measured signals 
on these scales. There is a slight tendency for the sig- 
nals to be negative, which is more pronounced (and sta- 
tistically significant) on far larger scales than are used 
in this work (Mandelbaum et al. 2011, in prep). This 
is most noticeable for the last data point in the low- 
est stellar mass bin. Using the full (non-rebinned) data, 
the for a fit to zero signal is 49.5, 23.5, 23.4 for the 
lowest to highest stellar mass bins. When computing the 
p{> x^), ke. the probability of getting a x^ value at least 
as large as the observed one due to random chance, we 



use a simulation described in Hirata et al. ( 2004 1 to ac- 



count for the noise in the bootstrap covariance matrices, 
which tends to artificially increase the values so that 
they deviate from the expected x^ distribution. Using 
that simulation, we find p{> x^) = 1, 59, and 59 per 
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9.00- 


-10.22 


78419 


9.808 


9.792 


2.06 


0.0555 


-19.87 


0.4608 


10.22- 


-10.70 


47419 


10.421 


10.428 


2.23 


0.0744 


-21.06 


0.6124 


10.70- 


-11.00 


7760 


10.807 


10.814 


2.34 


0.0762 


-21.86 


0.6821 



Table 3. Basic properties of the stellar mass bins used in the lensing analysis. The subscript "L" indicates lensing-weighted 
means, with weighting factors wi^. 



1.04- 
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Figure 14. Boost factors B{R) applied to the lensing signals 
shown in Fig. |13| for three bins with weighted mean 
stellar masses of 0.62, 2.68, and 6.52xlOl°M0, (blue inverted 
triangles, green circles, and red triangles, respectively). As 
expected, the boost factors are consistent with unity for R > 
200 kpc. 
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Figure 15. Measured lensing signals ASi.and(J2) around 
stacked random galaxies for three bins with weighted 
mean stellar masses of 0.62, 2.68, and 6.52 X 10^° Mq (blue 
inverted triangles, green circles, and red triangles, respec- 
tively) . 



cent, respectively. The low p- value for the lowest stellar 
mass bin is driven by the last data point. However, this 
slight sign of systematic shear does not affect our ability 
to do science: because galaxy-galaxy lensing is a cross- 
correlation, we can simply subtract the signal around 
random points from the signal around real lenses 



5.2.2 45- degree test 

While gravitational lensing causes a coherent tangential 
shear effect around the lens galaxies, it does not cause 
any average shape distortion in the other (45 degree) 
ellipticity component. However, there are systematics 
that could cause such a nonzero AE45, so we measure 
it (using the analogous equation to Eq. |6] but with the 
other shear component) as a systematics test. The result 
is shown in Fig. |16[ as shown, it is consistent with zero 
for all 3 samples, with a for a fit to zero of 16.4, 15.5, 



More detailed investigations of the efficacy of the random 
points subtraction procedure will be presented in Mandel- 
baum et al. (2011, in prep). 



36.3 (23 degrees of freedom), giving a p(> x^) ~ 90, 
93, and 12 per cent, for the lowest to highest stellar 
mass bins. Thus, there is no clear evidence of systematic 
errors from the 45-degree test. 



5.2.3 Ratio test 



As described in Sec. |4.4.8l we have separately calculated 
the lensing signals for the sources split into r < 21 and 
r > 21 separately, to ensure that we properly under- 
stand the shear calibration bias as it depends on galaxy 
properties. 

We fit the lensing signals for the r < 21 and r > 21 
source samples to the best-fit halo profile for our fidu- 
cial fit to the lensing signal for all sources (described in 
Sec. 6.2 1, i.e., we fix the shape and only allow the nor- 



malization to vary. We find best-fitting scale factors of 
0.99 ± 0.13 and 1.04 ± 0.16 for the r < 21 and r > 21 
source samples, respectively, after taking the average 
over the three stellar mass bins (since the bins largely 
sample the same source populations, given that they 
are all at low redshift, below most of the sources). Note 
that the 13 and 16 per cent uncertainties do not rep- 
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Figure 16. Measured lensing signals AT,45{R) for three Af, 
bins with weighted mean stellar masses of 0.62, 2.68, and 
6.52 X IO^^Mq (blue inverted triangles, green circles, and 
red triangles, respectively). 



resent our systematics floor; the systematics floor is set 
by the arguments in Section 4, and we carry out this 
test to confirm that there are no obvious observational 
reasons not to trust the arguments given there. We find 
no evidence for an offset between the lensing signals for 
the different source samples, and the calculated ratio, 
0.95±0.19, is consistent with the predicted ratio of 1.03. 



6 FITS TO THE LENSING SIGNAL 

We perform fits to the measured lensing signals to de- 
termine halo masses and halo-to-stellar mass ratios as 
a function of stellar mass. We describe these fits in this 
section, and present their results in the following sec- 
tion, Sec. [T] We begin by describing the different com- 
ponents of the model lensing profile (Sec. 6.1 1. Then, 



we describe the fiducial fits from which we obtain our 



main results (Sec. 6.2 1. Finally, we describe various al- 



ternative fits that we use to test the robustness of these 



results (Sec. 6.31. Overall, we find that our results are 
not very sensitive to the particular choices made in these 
fits. 



6.1 Density profiles 

There are several contributions to the observed lensing 
signal AE(i?). The dominant contribution comes from 
the density profile of the dark matter halo in which the 
galaxy lives (also called the one-halo term). On large 
scales, beyond 1 Mpc, there is a contribution from statis- 
tical correlations between dark matter halos from large 
scale structure (also called the halo-halo term). On small 



scales, below several hundred kpc, there is a contribu- 
tion from the baryons (stars and gas) in the galaxy. 

Figure [13] shows these three different contributions 
to the lensing signal as (black) dotted, (red) dashed, and 
(blue) dot-dashed curves, respectively. The one-halo and 
halo-halo terms shown are the best-fitting halo profiles 



from fits to the data (described in Sec. 6.21. The bary- 
onic contributions shown are for galaxies with stellar 
mass equal to the lensing-weighted average M* for each 
stellar mass bin, and gas masses estimated from the av- 
erage relation between gas-to-stellar mass ratios and M, 
(based on data from Rll, c.f. their fig. 11) 

We find that over the range of scales used for 
the lensing analysis, 50-2 000 kpc, the contribution to 
the lensing signal from the baryonic component of the 
galaxy is negligible]^ On the other hand, the contri- 
bution from the halo-halo term is negligible below ~ 1 
Mpc, and its contribution to the total lensing signal for 
the outermost radial bin is comparable to the la uncer- 
tainty in the measurement. 

Recall that we have excluded satellite galaxies from 



the lens sample (as described in Sec. 3.2 1 to simplify the 



analysis and interpretation of the lensing signal. For a 
satellite galaxy, which resides in a dark matter halo that 
is a subhalo in some larger host halo, the lensing signal 
will also have a contribution on several hundred kpc to 
1 Mpc scales from the host halo. Usually, this term is 
interpreted statistically in terms of a halo model de- 
scribing the fraction of the lens galaxy sample that are 
satellites, and using the halo mass function to estimate 
the typical lensing signal due to the host halos. By ex- 
cluding satellites from our lens sample, we remove the 
need to model this term, thus simplifying our analysis. 

We model the halo mass distribution as a Navarro- 
Frenk- White (1996; hereafter NFW) profile of cold dark 
matter haloes 



p(r) = 



(r-/rs)(l-f r/rs)2 



(15) 



defined by two parameters, a characteristic density ps 
and scale radius rs, or alternatively, the virial mass 
M200C and concentration C2ooc ~ r2ooc/rs- In this work, 
we adopt the definition of the virial mass as the mass 
enclosed within the virial radius r2ooc within which the 
average density is equal to 200 times the critical density 
of the Universe today pcrit , 



The estimated gas-to-stellar mass ratios are 0.99, 0.41, 
and 0.24, for the three stellar mass bins, respectively, based 
on the best-fitting relation from a weighted fit to the 
189 galaxies in the TFR sample of Rll: log(Mgas/M*) = 
(-0.19 ± 0.02) -I- (-0.60 ± 0.04) (log M. - 10.102). 

Because the lensing signal at a given radius includes con- 
tributions from the mass at all smaller radii, it is not obvious 
a priori that the relative contribution to the lensing signal 
from the baryonic component of the galaxy (with a typical 
scale length of several kpc) will be negligible on the scales 
used in our fits (50-2000 kpc) . Since the baryons are predomi- 
nantly on scales i? < 10 kpc, their contribution to the lensing 
signal on the relevant scales is simply AS^,ar = -'^^bar / ('"'^^ ) • 
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M20OC = -^r200c(200pcrit) 



(16) 



where the subscript denotes that this mass definition 
uses 200pcrit- Compared to the other commonly adopted 
mass definition of M2oob using 200p — 200nmPcrit, the 
virial masses and concentrations using 200pcrit are lower 
by roughly 30 and 60 per cent, respectively (for the 
range of halo masses we study). 

The NFW halo concentration is a weakly decreasing 
function of halo mass with a typical dependence given 
by 



C200c(Af200c) — C200c,0 



M20OC 
1012M( 



(17) 



with I3c ~ 0.1 (Bullock et al. 2001 Neto et al. 2007 



Maccio et al. 20081. In our fiducial fits, we adopt the 



results of A'^-body simulations from Maccio et al. (2008) 
with cosmological parameters from WMAP5 (Komatsu' 
20091. For all (both relaxed and unrelaxcd) haloes 
with 10^° < Mzooc/Mq < 10^^ they found /3c = 
0.110. Following their redshift evolution model for the 
concentration-mass relation, we scale their relation by 
H{z)~^^^ to account for the difference in the effective 
redshifts of our two samples {z = vs. 0.07); this yields 
C200c,0 = 6. 00 They also report a log-normal scatter 
in the halo concentrations at fixed halo mass of 0.130 
dex (this reduces to 0.105 dex when the sample is re- 
stricted to only relaxed haloes). Our modelling of the 
lensing signals does not take into account the scatter 
in the concentration-mass relation at fixed halo mass. 
However, we do not think that this will significantly af- 
fect the results of the fits since we find that they are 
not very sensitive to the overall normalization of the 
concentration-mass relation itself, as shown in Sec. |6.3[ 
The halo-halo contribution to the lensing signal is 
modeled using the galaxy-matter cross-power spectrum 
as in, e.g., Mandelbaum et al. ( 2005b I. It is proportional 



to the bias 6, the ratio of the galaxy-matter correlation 
function to the matter autocorrelation function. We es- 
timate the bias using the fitting formulae from |Seljak fc| 
Warren ( 2004 1 . At the mean redshift of our galaxy sam- 
ple z = 0.07, the growth factor D{z) = 0.9665 and the 
non-linear mass is Mni = lO^^ ^'^^Mo. We find the typ- 
ical bias of galaxies in our sample to be approximately 
0.7-0.9. Note that our removal of satellite galaxies from 
the lens sample may modify the halo-halo term in such 
a way that this model is no longer accurate, but we also 
note that this effect may not be significant because the 
satellite cuts remove only ~ 15 per cent of the sample. 
Reassuringly, we find that our fits are not very sensitive 
to the assumptions that go into the modelling of the 
halo-halo term, as shown in Sec. |6.3| 

In principle, intrinsic alignments of galaxy shapes 
towards the lens ( [Hirata fc Seljak||2004[ ) can mimic a 



15 Using H{z) Ho^nA + J^m(l + z)'-^, we find that the 
evolution in redshift amounts to only a 2 per cent growth 
from z = 0.07 to 0. 



negative lensing signal. This effect can be important if 
there is significant weight in the lensing signal given to 
"sources" that are actually physically associated to the 
lens. In that case, this uncertainty in the theory would 
have to be modeled. As shown in Fig. |14[ the boost 
factors, which indicate what fraction of the sources are 
actually physically associated galaxies at the lens red- 
shift, are within ~ 2 per cent of unity on all scales. This 
finding is unsurprising for a disk galaxy lens popula- 
tion, which should be dominated by field galaxies. It is 
possible that there are very small numbers of source 
galaxies that are close enough to be affected by the 
same tidal field as the lens (i.e., within a few tens of 
Mpc) without causing a boost factor > 1, because we 
expect some galaxies at these distances even for a ran- 
dom distribution. However, the weighting of the sources 
by 1/Ec means that such nearby galaxies receive an ex- 
tremely low weight, in addition to the fact that they 
are already quite rare given the relatively small comov- 
ing volume around zicns compared to that around the 
source median redshift of ~ 0.4. Thus, we expect intrin- 
sic alignments to be completely subdominant compared 
to all other errors described in this work, and we neglect 
them in the modelling. 

6.2 Fiducial fits 

Here, we describe the fits from which we obtain the 
results presented in Sec. [7] First, we individually fit 
to the measured lensing signals for the three stellar 
mass bins, to obtain the best-fitting halo-to-stellar mass 
ratio M2ooc/A^* for each bin (Sec. 6.2.1 1. Second, we 



model the HSMR with four free parameters and si- 
multaneously fit the measured lensing signals for the 
three stellar mass bins to obtain the best-fitting HSMR 
(Sec. 10^. 



6.2.1 Fits to M2ooc/M. 



The model density profiles described in Sec. 6.1 (and 



shown in Fig. 13 1 represent the lensing signal of a single 
halo. We model an individual halo profile as a sum of 
the one-halo and halo-halo contributions, assuming a 
fixed concentration-mass relation given by Eq. |17| On 
the other hand, the measured lensing signal for each 
stellar mass bin has contributions from a large number 
of haloes, with a distribution of halo masses. To take 
this into account, we fit the measured lensing signals 
to the mean profile of an ensemble of individual haloes 
with a distribution of halo masses based on the actual 
distribution of stellar masses in each bin. 

For our fiducial fits, we assume a log-normal scat- 
ter of o"iogM2ooc — 0.1 dex at a fixed M*, and con- 
volve the distribution in logM* by a Gaussian distri- 
bution with this width before the conversion to a dis- 
tribution of halo masses. Typically, this scatter is ex- 
pressed in terms of the log-normal scatter in stellar mass 
at a fixed M200C, ciogM*. Previous analysis of SDSS 
galaxies found that the observed stellar mass function 
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(for all galaxy types) can be adequately fit by assum- 
ing Q-iogM. = 0-15 dex ([Moster et al.||2010[) and 0.175 



dex (Behroozi et al. 20101. We expect the amount of 



scatter to be somewhat smaller for our sample includ- 



ing only disk galaxies. Moreover, aio. 



s M20OC 



is smaller 



than o"iog M, because of the shallow slope of the HSMR 
over the range of stellar masses we study. Therefore, our 
choice of o"ic,gM2ooc =0.1 dex is a conservative one, even 
after accounting for the statistical uncertainty in M« of 
0.04 dex. We consider alternative fits with no scatter 
and with twice the fiducial value in Sec. 16.31 

Given the above modelling assumptions, the pre- 
dicted lensing signal for some stellar mass bin is deter- 
mined by a single parameter, the halo-to-stellar mass 
ratio M2ooc/M,, after one assumes a certain dependence 
of M2ooc/M-t on M«. For the fits to the lensing signals 
for each M« bin, we assume a constant M200c/M, (over 
the stellar mass range of each bin). We use a Levenberg- 
Marquardt minimization routine ( Levenberg|1944 Mar- 
|quardt'1963) to fit for log M2ooc/M. using the measured 
lensing signal from ~ 50-2000 kpc (27 radial bins in all). 
For each stellar mass bin, we perform an independent fit 
to each of the 500 bootstrap resamplings of the 200 sub- 
regions. From the mean and width of the distribution of 
bootstrap parameters, we obtain the best-fitting value 
of M2ooc/M« and its Icr bootstrap error. We take the 
5 per cent uncertainty in the shear calibration into ac- 
count in the bootstrap errors as follows. For each boot- 
strap dataset, we multiply the predicted lensing signal 
by a random number sampled from a Gaussian distribu- 
tion centered at 1 and with a standard deviation of 0.05 
(we note though that the inclusion of this uncertainty 
has a negligible effect on the error in A/200c/Af,). 



6.2.2 Fits to the HSMR 

There is a growing consensus that the halo-to-stellar 
mass ratio of galaxies shows variation with stellar mass, 
with a minimum at M200C ~ 10^^ Mq and M« ~ 5 x 
lO"^Af0 and increasing towards lower and higher masses 
jMandolbaum ct al. 2006c 



Conroy fc Wechslcr 2009 



Moster et al..2010,. Behroozi et al.|2010||Guo et al..2010 



Leauthaud||2012 1. Based on the halo occupation distri 
bution (HOD) modeling of Behroozi et al. (2010 1, the re- 
lation between the halo-to-stellar mass ratio M2ooc/Af, 
and M« (or HSMR) can be modelled as 



log 



A/20( 



log 



Ml 



+ (/3 - 1) log 



+ 



{M./A'L,oy _ 1 
1 + (M./M.,o)"^ 2' 



(18) 



Here, Mi is the characteristic halo mass, M,^o is the 
characteristic stellar mass, /3 is the faint end slope, and 
(5 and 7 control the massive end behavior. |Leauthaud| 
(|2012) adopted the same functional form to constrain 
the HSMR via a joint analysis of galaxy-galaxy lensing, 
galaxy spatial clustering, and galaxy spatial densities of 
galaxies in COSMOS. 



It is unclear what the form of the HSMR is for a 
sample of only late-type galaxies. Moreover, there are 
not many disk galaxies that have stellar masses larger 
than the mass where the HSMR is at its minimum (for 
reference, our galaxy sample rang es fr om. A/, ~ 10^ - 
^^Mq). We choose to adopt Eq. 18 as a fitting func- 



10 

tion for the HSMR, with the understanding that we 
cannot strongly constrain its behavior at high stellar 
masses. Since we are not sufficiently sensitive to S to fit 
for it, we fix 5 = 0.566, following the results of [Leau-] 



thaud| ( [20T2| ) for their lowest redshift bin z = [0.22, 0.48] 
and SIG_mod1 run (c.f. their table 5); they also report 
a la uncertainty in 5 of 0.0860 

We fit for four free parameters — a normalization, 
break, a faint end slope, and a bright end slope. The 
basic set-up is similar to the fits to Af200c/Af* described 
in Sec. |6.2.1| For fits to the HSMR, we simultaneously 
fit the measured lensing signals for galaxies in the three 
stellar mass bins using a Levenberg-Marquardt mini- 
mization routine. In all, there are 27 (radial bins) x 3 
(stellar mass bins) — 81 data points. We perform an in- 
dependent fit to each of the 500 bootstrap resamplings 
of the 200 subregions to obtain the median HSMR as 
well as la and 2a error envelopes. We take into account 
of the 5 per cent uncertainty in the shear calibration 
in the bootstrap errors in the same way as described in 
Sec. |6.2."l| for each bootstrap dataset, we multiply the 
same random number to the lensing signal for all three 
stellar mass bins. 



6.3 Alternative fits 

In this subsection, we summarize the results of our tests 
of the robustness of the results of the fiducial fits (de- 
scribed in Sec. 6.2 1 to various modeling assumptions. In 



each case below, we change a single parameter in the 
fiducial fit and check the effect of this change on the re- 
sults. In all cases, we find that the results are robust to 
reasonable changes in the modelling assumptions, and 
we quote the magnitude of each effect below. 

• Fits without the halo-halo contribution 

In our fiducial fits, the assumed contribution from 
the halo-halo term depends on a theoretically-uncertain 
estimate of the galaxy bias, as well as an assumed func- 
tional form. This may not be an accurate model for our 
lens sample, from which satellite galaxies have been re- 
moved. However, as noted in Sec. 6.1 (and shown in 
Figure 13 1, the halo-halo contribution is negligible be- 
low ~ 1 Mpc and only affects the lensing signal measure- 
ment in the outermost radial bin at ~ 2 Mpc. Moreover, 
the amplitude of the halo-halo term is comparable to the 

We note that changes in 5 only affect the HSMR over a 
small portion of the range of stellar masses we study, M» ~ 4 
- 8 X lO^^AfQ, and even a 5(T change in <5 leads to differences 
well below our measurement uncertainties. Therefore, our re- 
sults are not very sensitive to this modelling assumption. 
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uncertainty in the measurement at that radius, so we do 
not expect our fits to be very sensitive to changes in this 
term. 

To test this explicitly, we perform fits to model 
profiles that do not include the halo-halo contribution 
at all. As expected, these fits yield slightly higher 
best-fitting M2ooc/M,, by 0.02 - 0.03 dex, which is weU 
below the 1-a errors, 0.09 - 0.13 dex. 

• Fits with an alternate concentration-mass relation 



of satellite galaxies from the lens sample (described in 
Sec. 3.2 1, we perform fits to the disk galaxy sample 



before the satellite cut. We do not attempt to model 
the contribution of satellites to the lensing signal, but 
simply restrict the range of the fits to exclude the 
outermost radii, where the satellite contribution domi- 
nates. Using the lensing signal from ~ 50 - 300 kpc (11 
radial bins in all), we find best-fitting M2ooc/Mf that 
are consistent to within la of those from the fiducial 
fits to the sample with the satellite cut appliedrj 



To test the sensitivity of the fits on the adopted 
concentration-mass relatio n. Eg. |17| we perfo rm fits 
with an alternate relation. Mandelbaum et al. (2008a) 



used lensing measurements of SDSS galaxies brighter 
than L» to fit for the relation over halo masses 10^^ < 
Maoob/Mg < lO^'' and found C200b(A^200b) ~ (4.6 ± 
0.7) X (M2oob/(1.56 ± 0.12) x IO^^/i-^Mq)-'"-"*'^-''^' 
at z = 0.22. Converting to our virial mass definition 
M200C and scaling by {1 + z)^^ to our mean redshift 
of 0.07, we find the equivalent parameters for Eg. |17| to 
be C2ooc,o = 5.84 and /3c ~ 0.14. Halo concentrations de- 
rived from this alternate relation are around 10 per cent 
lower than the fiducial one. One would expect the lower 
concentrations to yield systematically higher M2ooc/Af*; 
we indeed find higher values but the difference is tiny, 
< 0.01 dex for the three stellar mass bins. 

We note that Prada et al. (20111 studied halo 



concentrations using a set of state-of-the-art A'^-body 
simulations. They report halo concentrations that are 
~ 10 per cent lower than those of Maccio et al. (2008) 
for Milky Way-size haloes; this difference is of the same 
order as for the alternate relation we tested, so we do 
not expect it to have a significant effect on our fits 
eitherE!] We conclude that our results are not sensitive 
to our particular choice of concentration-mass relation. 

• Fits with alternate values of aiog A/200C 



• Fits to a lens sample with an axis ratio cut 

Recall that the selection of the TFR sample from 
which Vopt is derived includes an axis ratio cut b/a < 
0.6. The lens sample from which 1^200c is derived has not 
been subjected to an axis ratio cut. In this section, we 
show that this difference in selection does not compro- 
mise our results for Vopt/V2ooc. 

We calculate halo-to-stellar mass ratios for the sub- 
sample with b/a < 0.6 and found log(M2ooc/Aft) = 
1.75 ± 0.14, 1.43 ± 0.13, and 1.37 ± 0.20 for each 
stellar mass bin, respectively. For a comparison with 
the full sample that takes into account the correla- 
tion between the two samples, we calculate the ra- 
tio of halo masses for each bootstrap resampling. We 
find that the bootstrap distributions in M^oodb/a < 
0.6 subsample)/M2ooc(full sample) are approximately 
Gaussian, with mean values consistent with unity: 1.37± 
0.36, 1.21 ± 0.25, and 0.96 ± 0.33, for each steUar mass 
bin, respectively, indicating that the application of the 
axis ratio cut does not significantly bias the results pre- 
sented here. We also note that there is considerable 
overlap between the two samples being compared; con- 
seguently, the derived halo masses are correlated at the 
60-70 per cent level, but the bootstrap errors on the ra- 
tios above automatically account for these correlations. 



To convert the distribution in stellar masses in each 
bin into a distribution in halo masses, we assume a 



log-normal scatter of ai, 



OS A/20OC 



= 0.1, at a fixed M,. To 



test the sensitivity of the fits on this choice, we perform 
fits with alternate values criogAf2ooc ~ 0-0 ^^'^ 0-2 dex. 
Assuming zero scatter yields best-fitting M2ooc/M, 
that are higher by 0.01 dex (for all three stellar mass 
bins), while assuming twice the scatter yields values 
lower by 0.02 dex; in both cases, the difference is much 
smaller than the 1-a uncertainty in the measurement 
itself. 

• Fits to a lens sample including satellites 

To test the sensitivity of the results to the removal 

^"^ We note that while the two works agree for Milky Way- 
size haloes, for cluster-size haloes, Prada et al. (2011) re- 
port substantially larger halo concentrations (by up to 50 
per cent) than Maccio et al. (2008). 



7 RESULTS 

In this section, we present our derived constraints on the 
HSMR and OVVR (Sees. [TT] & [7!2|. Then, we discuss 



comparisons of our results with previous work (Sec. 7.3 1 
and with predictions for ACDM haloes (Sec. 7.4 1. 



7.1 Halo-to-stellar mass relation 

Figure [17] shows our constraints on the relation between 
M2ooc/M, and M«. In gualitative agreement with the 
literature, our data show a clear variation in Al2ooc/M, 
with M* and suggest a minimum at M* ~ 5 x IO^^Mq. 



We note that the effect of the inclusion of the satellites 
varies with stellar mass: the sign is zero, positive, and neg- 
ative for the lowest, intermediate, and highest stellar mass 
bins, respectively. Not surprisingly, the sign and amount of 
the difference also depends on the choice of range of radii 
used for the fit. 
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Best-fitting M2ooc/Af* for three stellar mass bins are 
shown as open cirles (and listed in Table |4]| ; the me- 
dian relation and la and 2<j error envelopes are shown 
by the thick solid curve and dark and light grey shaded 
regions, respectively. Recall that the bootstrap error en- 
velopes include the 5 per cent uncertainty in the shear 
calibration. We note that we use results from 471 out 
of the 500 bootstrap datasets (of 200 subregions) that 
resulted in converged fits. 

We provide the derived constraints in tabular form 
in Table [5] for a grid in stellar mass from M« = 10^ - 
W^^ Mq in steps of 0.1 dex. Since each bootstrap dataset 
is fitted by a HSMR with a different set of parameters, 
the median curve and error envelopes do not exactly 
follow the functional form in Eq. |18| To obtain an ana- 
lytical fitting formula, we fit the derived median relation 
to Eq. |18[ for which we find best-fitting parameters: 



log(Mi/M,,o) 
logM,,o 
/3-1 
7 



I. 57 ±0.04 

II. 00 ±0.03 
-0.50 ±0.03 
2.60 ±0.77, 

with S = 0.566E3This relation is shown as the thin solid 
curve in Fig. |17| as shown, it very closely approximates 
the median curve (shown by the thick solid curve) over 
the range of stellar masses we study. For this model and 
the data from the bootstrap mean of the lensing signals 
of the three stellar mass bins, we calculate a value of 
46 for 68 — 4 = 65 degrees of freedom, indicating that 
the data are consistent with this model. 

We calculate Pearson correlation coefficients 
r{Pi,Pj) and covariances using the results from the 
bootstrap datasets, where Po ~ log(Mi/M»,o), -Pi = 
logM.^o, P2 — 13 — 1, and P3 = 7. We find a sig- 
nificant correlation between two pairs of parameters: 
r{Po,P2) = 0.68 and r{Pi,P2) = 0.48, both with p- 
values ^ 1. We also find that Pi and P3 correlate in a 
peculiar manner; their joint distribution does not follow 
an elliptical contour. We note that these covariances are 
properly taken into account in our fits through our use 
of bootstrap analysis. 

The right vertical axis of Fig. [Tt] shows stellar con- 
version efficiencies r;, = (Af,/Af2ooc)/b'^i the percent- 
age of the cosmologically available baryons that end 
up as stars in the galaxy, with /b = ilm/^b = 0.169 



(WMAP7, Komatsu et al. 2011 K Including the con- 
tribution of cold gas, we also estimate "baryon reten- 
tion fractions" Tjb = [(M* ± Mgas)/M2oocl/,7^ For the 
three bins with lensing-weighted average stellar masses 
of 0.62, 2.68, and 6.52 x 10^° Mq, we find 77. =0.151^4. 
-1-0.06 n oQ+0 08 ^ 0.30+°'^° " ny+o'.os 



0.26_o.o5i 0-23_o.o6 



-0.08' "-J'-O.OTi 



0.291q q7, respectively, with la error bars based on the 
uncertainties in M2ooc/M,. Here, we have used gas-to- 
stellar mass ratios of 0.99, 0.41, and 0.24, respectively. 



Allowing <5 to freely vary does not lead to a closer approx- 
imation to the median HSMR, so we present the fits with S 
fixed to its fiducial value. 



Table 4. Constraints on Af200c/A/* from individual fits to 
the lensing signals of galaxies in three stellar mass bins 
(col. 1), stellar conversion efficiencies r;* = (Mt/M200c)/|7^ 
(col. 2), and V'opt/V200c from combination with the calibrated 
TFR (col. 3). Our main results are from the full lens sample 
(top). We also list here results for the subsample with axis ra- 
tio cut q < 0.6 (bottom); we find no systematic bias between 
the two, as described in the final subsection of Sec. 16.31 





log(Af200c/Af*) 


V* 


Vopt/V200c 


Full lens sample 


9.792 
10.428 
10.814 


1.61 ± 0.13 
1.36 ±0.09 
1.42 ± 0.13 




'^•-^^-0.04 
26+° °'' 

r, 90 + 0.08 

'^•^■'-0.06 


1.26 ±0.08 
1.39 ±0.06 

1.27 ± 0.08 


q < 0.6 subsample 


9.772 
10.431 
10.812 


1.75 ± 0.14 
1.43 ±0.13 
1.37 ±0.20 


10+°-°'4 
r, 99+0.08 
'^•^^-0.06 


1.12 ±0.08 
1.31 ± 0.09 
1.31 ± 0.13 



Table 5. Median HSMR and ±1 and 2a error envelopes. 





-2cr 


-Icr 




±la 


+2a 


9.0 


1.27 


1.74 


2.09 


2.33 


2.59 


9.1 


1.28 


1.71 


2.03 


2.26 


2.51 


9.2 


1.29 


1.69 


1.98 


2.18 


2.43 


9.3 


1.28 


1.66 


1.93 


2.11 


2.34 


9.4 


1.28 


1.63 


1.88 


2.05 


2.25 


9.5 


1.28 


1.59 


1.83 


1.98 


2.15 


9.6 


1.29 


1.57 


1.77 


1.92 


2.06 


9.7 


1.30 


1.55 


1.72 


1.85 


1.98 


9.8 


1.31 


1.52 


1.67 


1.79 


1.90 


9.9 


1.30 


1.49 


1.62 


1.72 


1.81 


10.0 


1.28 


1.46 


1.57 


1.66 


1.74 


10.1 


1.28 


1.43 


1.52 


1.60 


1.67 


10.2 


1.28 


1.39 


1.47 


1.55 


1.61 


10.3 


1.23 


1.35 


1.43 


1.50 


1.57 


10.4 


1.17 


1.31 


1.39 


1.46 


1.53 


10.5 


1.14 


1.26 


1.35 


1.43 


1.51 


10.6 


1.06 


1.23 


1.32 


1.41 


1.50 


10.7 


1.02 


1.19 


1.31 


1.41 


1.51 


10.8 


0.96 


1.18 


1.33 


1.44 


1.52 


10.9 


1.02 


1.23 


1.42 


1.55 


1.75 


11.0 


1.15 


1.37 


1.61 


1.92 


2.17 



based on the empirical relation described in Sec. |6.1 
(and quoted in a footnote there). 



7.2 Optical-to-virial velocity relation 

We derive constraints on the relation between 
V;)pt/V2ooc and Af* (or OVVR) as outlined in Sec. [24" 
For each bootstrap dataset, (i) we generate a Vopt vs. 
Af« relation based on a TFR with the zero-point and 
slope randomly sampled (independently) from a Gaus- 
sian distribution centered on the best-fitting values and 
of Gaussian widths equal to the la fit uncertainties in 
those values. (We note that this scatter in the TFR is 
negligible compared to the uncertainty in the HSMR, of 
~ 13 per cent.) Next, (ii) we convert the Af200c/A/, vs. 
Mt, relation into a Af200c vs. Af* relation, and then into 
a V200C vs. M, relation via V200C = (GAf2aoc/'"200c)^''^ 
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logM, (Mq) log M.(Me,) 



Figure 17. Constraints on the HSMR from simultaneous fits 
to the lensing signals for three stellar mass bins. The thick 
solid curve and dark and light grey shaded regions show the 
median relation and its Icr and 2cr error envelopes. Best- 
fitting A^200c/A^* for the three stellar mass bins are shown 
by circles with Ict error bars; horizontal error bars indicate 
bin widths. The thin solid curve (which largely overlaps with 
the thick solid curve) shows the analytical fit to the median 
relation, given by Eq. |18| with best-fitting parameters listed 
in the main text. Results from Mandelbaum et al. (2006) are 
shown by red stars with la error bars. Published HSMRs 
from Dutton et al. (2010) and Leauthaud et al. (2011) are 
shown by the blue dot-dashed and red dotted curves, re- 
spectively, after applying conversions for differences in the 
choices of stellar IMF and virial mass definition. Numbers 
on the right vertical axis indicate stellar conversion efficien- 
cies TJ^ = (M./M200c)/i7^ 



Dividing relations (i) and (ii) yields a OVVR for each 
bootstrap dataset. Finally, we obtain the median rela- 
tion and its la and 2a error envelopes from the boot- 
strap distributions. 

We also obtain K.pt/V200c for each stellar mass bin. 
First, we multiply the best-fitting M2ooc/Mt by the 
lensing-weighted mean A/, to obtain the corresponding 
M200C, convert that into 1^2ooc, and finally, take the ratio 
of that and the lensing-weighted mean Vopt- The results 
are listed in Table |4] 

Figure[l8]shows the V200C vs. M, relation and its la 
and 2a error envelopes (solid curve, dark and light grey 
shaded regions, respectively), together with the Vopt vs. 
Mf relation, Eq.[7| (thick dashed line). Figure [T9| shows 
the median OVVR and its 1-a and 2-a error envelopes 
(solid curve, dark and light grey shaded regions, respec- 
tively). We provide these constraints in tabular form 
(Table [6| for a M, grid from logM^/M© = 9 - 11 in 
steps of 0.1 dex. Numbers on the right vertical axis of 
Fig. [T9| indicate Vopt/V2ooc- 

We constrain the OVVR to around 6 per cent (la) 



Figure 18. Constraints on the V20OC vs. A/» relation are 
shown by the solid curve and dark and light grey shaded 
regions (median, la and 2a error envelopes, respectively). 
Virial velocities V20OC for the three bins are shown by 
circles with la error bars; horizontal error bars indicate bin 
widths. The Vopt vs. M^, relation, Eq. [?[ (derived in Rll) is 
shown as the thick dashed line. For comparison, we also plot 
the V20OC vs. M« relation from the OVVR in Dutton et al. 
(2010) (blue dot-dashed curve) and their V2.2 vs. M, TFR 
(thin blue dashed line). 



and find Viopt/V200c ~ 1-3 for stellar masses 10^ — 10^^. 

1/3 

Recall that since V200C oc Mjqq^., the uncertainty on 
log ( Vopt /V2ooc) is a third of that in logM2ooc. We note 
that the shape of the OVVR is partly dictated by the 
assumed functional form for the HSMR, Eq. |18| al- 
though the best-fitting function suggests a turn-over 
at ~ 3 X IO^'^Mq, the data are consistent with a flat 
Vopt/V2ooc with M*. 



7.3 Comparison with previous work 

Figure [19] shows our constraints on the OVVR together 
with those from previous work that used a similar 
methodology as ours, namely, the combination of halo 
mass measurements (from weak lensing and/or other 
techniques) with TFR measurements from galaxy rota- 
tion curves. 

For late-type L* galaxies from an early SDSS 
dataset, jSeljak (2002) combined weak lensing measure- 
ment s from|Guzik fc Seljak (2002) with an J-band TFR 
from Giovanelli et al. ( 19971 and found Vopt/V2ooc = 1-8 
with a 2a lower limit of 1.4 (shown by the red star and 
error bar in Fig. 19 1; there is no upper limit because 
the halo masses were consistent with zero. This mea- 
surement is only marginally consistent with ours, but 
our result benefits from significant improvements in the 
measurements of both weak lensing and the TFR, due 
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Figure 19. Constraints on the OVVR are shown by the solid 
curve and dark and hght grey shaded regions (median, Icr and 
2a error envelopes, respectively). Vopt/VijoOc for the three M, 
bins are shown by circles with Icr error bars; horizontal error 
bars indicate bin widths. The red star symbol is plotted with 
a 2(T error bar and show Vopt/Viooc derived for galaxies 
by Seljak (2002). The blue dot-dashed and dotted curves 
show the V2.2/V20OC and V80/V20OC vs. M, relations derived 
by Dutton et al. (2010), before and after "correcting" for 
the differences in the TFRs used in that work vs. this one 
(note that the two differ by 0.05 dex). The dark and light 
green shaded regions show the variation in Vniax,h/V200c for 
unmodified pure dark matter NEW haloes, corresponding to 
Itr and 2cr scatter (0.13 and 0.26 dex) in halo concentrations 
C200c(A^200c) (given by Eg. |17| l at a fixed halo mass. 



Table 6. Median OVVR and ±1 and 2(7 error envelopes. 





-2a 


-la 




+ la 


+2a 


9.0 


-0.198 


-0.101 


-0 


022 


0.094 


0.249 


9.1 


-0.174 


-0.081 


-0 


009 


0.099 


0.244 


9.2 


-0.150 


-0.065 





003 


0.101 


0.238 


9.3 


-0.126 


-0.046 





016 


0.106 


0.232 


9.4 


-0.095 


-0.028 





028 


0.108 


0.226 


9.5 


-0.072 


-0.013 





039 


0.115 


0.219 


9.6 


-0.049 


0.003 





051 


0.118 


0.211 


9.7 


-0.026 


0.018 





063 


0.121 


0.203 


9.8 


-0.003 


0.036 





076 


0.124 


0.200 


9.9 


0.021 


0.053 





085 


0.129 


0.193 


10.0 


0.042 


0.067 





098 


0.134 


0.194 


10.1 


0.057 


0.082 





109 


0.139 


0.189 


10.2 


0.071 


0.093 





119 


0.144 


0.187 


10.3 


0.081 


0.104 





129 


0.154 


0.193 


10.4 


0.087 


0.112 





136 


0.163 


0.209 


10.5 


0.088 


0.116 





143 


0.172 


0.212 


10.6 


0.082 


0.116 





147 


0.178 


0.230 


10.7 


0.075 


0.112 





144 


0.186 


0.244 


10.8 


0.064 


0.096 





133 


0.183 


0.251 


10.9 


-0.008 


0.053 





099 


0.160 


0.227 


11.0 


-0.158 


-0.075 





028 


0.111 


0.180 



to larger available datasets (both lens and source galaxy 
samples) and improved analysis methods. 

Dutton et al. (2010, hereafter DIO) combined their 
visual fit to the HSMR for late-type galaxies, modelled 
as a double power law, based on a compilation of mea- 



surements from weak lensing ( Mandelbaum et al. 2006b I 
and satellite kinematics ( [Conroy et al.|2007||More et al.| 
|2011| and Klypin, Prada & Montero-Dorta, in prepara- 
tion) with their fit to the V2.2 vs. A/« TFR based on 
rotation curve data from Pizagno et al. (2007). Their 
central HSMR is shown by the blue dot-dashed curve 
in Figure |17| For comparison, w e also show measure- 
ments from Mandelbaum et al.| ( |2006b[ ) based on late- 
type galaxies in SDSS selected via the frac_deV pa- 
rameter (red stars with la error bars), as well as the 
best-fitting HSMR from Leauthaud et al. (2011), mod- 
elled as Eq. |18[ and derived from combined early- and 
late-type galaxies in COSMOS (red dotted curve). The 
COSMOS result is for gala:x;ies at redshifts 0.2 — 0.5, and 
the rest are for galaxies at a mean redshift of z ~ 0.1. 

Figure [18] compares the V200C vs. M, relation de- 
rived from the HSMR in DIO with ours (blue dot-dashed 
and black solid curves, respectively). As shown, the two 
relations are consistent within ~ 2a over the range of 
stellar masses we study. The figure also compares the 
V2.2 vs. M, TFR used in DIO with the V2.2 vs. M, and 
Vso vs. M, TFRs from Rll. As shown, the V2.2 vs. M, 
TFR used in DIO is lower than that in Rll (blue dashed 
vs. red dotted lines). The difference between the TFRs 
is significant, even for the same definition of Vopt', this 
may be attributed to differences in the galaxy samples 
and analysis methods in the two analyses P°] Compared 
to the Vgo vs. M» TFR in Rll, the relation used in our 
derivation of the OVVR, the V2.2 vs. M. TFR in DIO 
turns out to have a similar slope and a lower normaliza- 
tion by ~ 0.05 dex. 

Figure 19 shows the 1^2.2/V200c vs. M» relation from 



DIO, after converting their V2.2/V200C vs. V2.2 relation 
into a V2.2/V200C vs. M* relation, using their V2.2 vs. 
M« TFR (blue dot-dashed curve). DIO estimates that 
V2.2/V200C — 1 for steUar masses M« = 5 x 10^ - 2 x 
IO^^Mq. For a fairer comparison with our results, we 
calculate the 'Kjpt/V2ooc — V80/V200C relation that one 
would get from DIO ff the Vgo vs. M, TFR from Rll 
is used instead (blue dotted curve); this is simply offset 
by -1-0.05 dex from the former curve This relation is 



There is a large overlap between the TER sample in Rll 
and the Pizagno et al. (2007) sample used in DIO. Out of 
189 galaxies in the Rll sample, 99 galaxies are from the 
Pizagno et al. (2007) sample (those out of the galaxies that 
passed our selection cuts). However, the analysis methods 
for deriving both photometric and kinematic quantities and 
their uncertainties, as well as fits to the TER, are completely 
independent. 

■^^ If one used our V2.2 vs. M, TER to derive the V2.2/V20OC 
vs. A/» relation, the curve will lie between the two blue 
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consistent within 2a of our derived OVVR (black solid 
curve) for the range of stellar masses we study. 



7.4 Comparison with ACDM haloes 

Dark matter haloes in a ACDM cosmology form a re- 
markably tight relation between halo mass and the max- 



imum circular velocity of the halo Knax,h ( Navarro et al. 
1997 1 ^ The question of whether the tightness of this 
relation translates into the tightness of the TFR is a key 
to understanding disk galaxy formation, and therefore, 
the relationship between Vmax,h and Vopt is of crucial 
interest. 

For a NFW halo profile, the ratio Knax,h/V2ooc de- 



pends only on the halo concentration (Navarro et al. 
[1997|: 



V200C 



0.465 



C200C 

^(C200c) 



(19) 



where A{x) = \n{l + x)~ x/ (l + x). The peak of the halo 
velocity curve occurs at a radius rmax.h — 2.163 Vs = 
2.163 r200c/c200c. For the range of halo masses we con- 
sider, C200C ranges from ~ 5 - 8, so rmax.h — 0.3 - 
0.4 r2ooc; the velocity curves of these haloes rise gradu- 
ally out to an appreciable fraction of their virial radii. 

The dark and light green shaded regions in Fig- 
ure 19 show the variation in Vmax,h/V2ooc for unmod- 



ified pure dark matter NFW haloes, corresponding to 
la and 2a scatter (0.13 and 0.26 dex) in halo concen- 
trations C2ooc(Af2ooc) (given by Eq. 171 at a fixed halo 
mass. Note that we have ignored the scatter in the hor- 
izontal direction (in other words, we have used the cen- 
tral HSMR to directly translate a grid of halo masses to 
the corresponding stellar masses). 

When comparing Vmax.h and Vopt (i.e., green vs. 
grey shaded regions in Fig. [l9j showing Vmax,h/V^200c 
and Vopt/Vbooc, respectively), note that only dark mat- 
ter contributes to Vmax,h, while both baryons and dark 
matter contribute to Vopt ■ Also note that rniax,h is larger 
than the optical radius Rso, by a factor of ~ 15; between 
these two radii, the halo circular velocity decreases by 
around 20 to 30 per cent[^ Despite these differences, 
the comparison is instructive: Vopt > Vmax.h indicates 
that the baryons have modified the potential well in the 
optical region of the galaxy, either by their own gravity 
and/or by modifying the structure of the dark matter 
halo (e.g., through adiabatic contraction). 

We find that Vopt ^ V^ax,h over the range of stellar 
masses covered by our sample, M. = 10^ - lO^M©. 
For the three stellar mass bins with lensing-weighted 
stellar masses of 0.62, 2.68, and 6.52 x IO^^Mq, we find 



Kpt/Kiax.h = 1.11±0.06, 1.25±0.05, and 1.16±0.07, re- 
spectively (with la uncertainties propagated from the 
uncertainties in Vopt/V200c, i.e., not including the un- 
certainty corresponding to the scatter in halo concen- 
trations at a given halo mass, among others). Assuming 
an NFW profile for the halo unmodified by the baryons 
yields Fmax.h/Kpt.h = 1.27, 1.30, 1.43. Multiplying this 
by Kpt/K^ax.h gives Vopt/Vopt.h = 1.41 ± 0.08, 1.61 ± 
0.05, and 1.64±0.11, with la uncertainties from the un- 
certainties in K.pt/Vmax,h. In terms of (3-dimensional) 
mass, this corresponds to a dark matter contribution of 
56, 47, and 45 per cent, suggesting that dark matter 
and baryonic contributions are comparable at the opti- 
cal radius. If the dark matter halo undergoes adiabatic 
contraction, this contribution will be even higher. 

It is interesting to compare the expected contribu- 
tion from the different components — dark matter halo, 
stellar and gas disks — against the observed total rota- 
tion velocity at the optical radius Vopt. In particular, the 
sum of the different contributions should not exceed the 
observed velocity; if they do, then one or more assump- 
tions in the modeling must be wrong. For a quick and 
crude comparison, we estimate the contribution of the 
baryons to the rotation velocity using our stellar mass 
estimates (based on Bell et al. 2003 M«/L ratios and 
assuming a fixed Kroupa IMF) and gas mass estimates 
(based on the Kannappan et al 2004 relation between 
gas-to-stellar mass ratio and from u — r colour) . We as- 
sume that the stellar disk scale length is given by the 
mean relation with stellar mass (derived in Rll, eq. 35), 
and that the gas disk has the same scale length]^ As- 
suming, as above, an unmodified NFW halo, we find 

(Kpt.h + Ki'pt.b)''''' /Vopt ~ 1.01, 1.00, 0.98 for the mean 
stellar masses of our three bins, M* — 0.62, 2.68, and 
6.52 X IO^^Mq, respectively. This suggests that the ob- 
servations are consistent with a model in which the ra- 
dial profile of the dark matter halo is close to an un- 
modified NFW profile, i.e., without the effect of adia- 
batic contraction. This result is of course sensitive to 
many other assumptions in the model, including halo 
concentrations, the inner profile of the halo (e.g., NFW 
vs. Einasto), the stellar IMF, stellar and gas mass esti- 
mates, among others. In future work, we will perform a 
careful comparison with a proper accounting of the full 
distributions and uncertainties (Reyes et al. in prep). 



8 SUMMARY AND FUTURE WORK 

In this work, we use measurements of the average halo 
masses M200C of disk galaxies from galaxy-galaxy lens- 
ing to direct constrain the relation between halo-to- 



The tightness of the relation holds for other cosmologies 
as well, but especially for ACDM. 

For an NFW halo, the halo velocity curve is given 
by V^ir) = ViQg^[c/A{c)]lA{x)/x], with x = r/r^. Thus, 
Vopt,h/Knax,h depends only on C200c through the scale length 
rg appearing in the argument x. 



This assumption is not valid for our low M» galaxies for 
which the gas is typically more extended than the stars (e.g., 
Cayatte et~aL]|1994| [Broeils k van Woerdenl|1994| |Rhee fc| 
van Albada|1996||Br(3eils fc Rhee|1997| [Swaters et al.|2002| 
Begum et al.|2008| . The stellar and gas disks will be modelled 
separately in our future analysis. 
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stellar mass ratios M2ooc/M, and stellar mass AI, for 
a large sample of disk galaxies from the SDSS with 
(z) ~ 0.07 and lO'' < M^/Mq < lO". Moreover, we 
combine these measurements with the TuUy-Fisher re- 
lation (TFR), which relates disk rotation velocities at 
the optical radius Vopt and stellar mass A/*, to con- 
strain the relation between optical-to-virial velocity ra- 
tios Vopt/V2ooc and stellar mass Af, . Unlike previous 
measurements of V^pt/Vbooc, we use similarly-selected 
galaxy samples and consistent definitions in both the 
lensing and TFR measurements to enable a fair com- 
bination of the two. In particular, we use the minimal- 
scatter TuUy-Fisher relation from Reyes et al. (2011) 
based on a galaxy sample that is, by construction, a 
fair subsample of the lens sample we use here. 

We model the relation between A^200c/A^, and Af, 
as a functional form based on halo occupation mod- 
elling, Eq. 18 and find that the ratio Af200c/A^* varies 



over the range of stellar masses we study, with a min- 
imum of « 20 at ~ 5 X 10^° Mq. For our three Af» 
bins with lensing-weighted stellar masses of 0.62, 2.68, 
and 6.52 x 10^° Mq, we find A^aooc/A/* = 41, 23, and 
26, respectively (with Icr uncertainties of around 0.1 
dex). These correspond to stellar conversion efficien- 



V* = {M,/M2ooc).f^' = 15 



+5 
4' 



26 



+1, and 23tl 



per 



cent, respectively (assuming a cosmic baryon fraction of 
/b = 0.169). Adding information from the TuUy-Fisher 
relation, we find Kpt/Vaooc = 1.27 ± 0.08, 1.39 ± 0.05, 
and 1.27 ±0.08, respectively. 

We find that the maximum halo circular veloc- 
ity VInax.h J; V^jpt over the range of stellar masses we 
study. For the three stellar mass bins we use, we find 
V;pt/Vmax,h = 1.11 ± 0.06, 1.25 ± 0.05, and 1.16 ± 0.07, 
respectively (with quoted la uncertainties accounting 
solely for the uncertainty in Vopt/Vaooc). Assuming an 
unmodified pure NFW halo profile, we find that the halo 
contribution to the rotation velocity at the optical ra- 
dius is given by Vopt/14pt,h = 1.41 ± 0.08, 1.61 ± 0.05, 
and 1.64±0.11 (again, with quoted la uncertainties ac- 
counting solely for the uncertainty in Vopt/Vaooc). This 
corresponds to a halo contribution in mass of roughly 
half, suggesting that dark matter and baryonic contri- 
butions are comparable at the optical radius. A crude 
accounting of the contribution of the baryons and the 
dark matter halo to the rotation velocity at the optical 
radius suggests that, given the many modelling assump- 
tions made, the data are consistent with a radial halo 
profile that is close to an unmodified NFW halo (i.e., 
with no adiabatic contraction). This result will be re- 
fined after a more detailed analysis, in which the mass 
distribution of the dark matter halo, stars, and gas, 
as well as the adiabatic contraction of the halo, will 
be modelled separately, and the distributions and un- 
certainties in the different parameters will be properly 
taken into account. 

The observational constraints derived in this work 
will serve as input to our models of disk galaxy forma- 
tion. The ultimate goal is to construct models that si- 
multaneously satisfy all the available observational con- 



straints (including those presented here and in Rll). We 
will also investigate the degeneracies between model pa- 
rameters, and identify other observations that may help 
to eventually break them. 
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APPENDIX A: SOURCE CATALOGUE 
GENERATION PROCEDURE 

The generation of the new catalogue begins with the 
explicit selection of what data to use. We first select 
runs and then within them, portions of runs, based on 
the following requirements: 

(i) The files required for all the steps of the catalogue 
reduction procedure exist in their proper format (includ- 
ing psField, fpAtlas, fpObjc, astrometry, and photomet- 
ric calibration). 

(ii) The IMAGE_STATUS fiag must be 4. Any 
higher order bits being set would indicate fundamen- 
tal issues with the data quality (e.g., bad focus). If it is 
set to 2 or 4, this may indicate issues with photometric- 
ity; however, we allow these data to be used provided 
that the photometric calibration procedure itself did not 
raise any fiags. 

(iii) PHOTO_STATUS== 0, indicating that the data 
were able to be processed by the Photo pipeline. 

(iv) The PSP_STATUS flag must be 0, which means 
that the PSF was able to be interpolated across the field 
using the standard second-order quadratics. 

(v) The field must be classified as photometric ac- 
cording to the uber-calibration procedure (Padmanab- 
han et al.|2008| l, i.e., we require CALIB_STATUS==1. 

(vi) The r-band PSF FWHM at the center of the field 
must be < 1.8". 

(vii) The r-band extinction (calculated as 2.751 x 



E(B — V), using the dust maps from Schlegel et al.|1998 
to obtain E{B — V) and the extinction-to-reddening ra- 
tios from |Stoughton et al.|2002l to convert them into the 
r-band extinctions) must be < 0.22 magnitudes at the 
center of the field. (We later require < 0.2 on a per- 
galaxy basis; the less stringent cut on the extinction 
at the field center simply eliminates fields for which no 
galaxies will pass the later cut.) 

Our goal is to select a galaxy sample that has reli- 
able shape measurements using the single-epoch images 
alone; no attempt is made to combine multiple measure- 
ments for the same galaxyP^ However, our initial run 
selection makes no attempt to avoid overlapping areas, 
as we will process all the galaxies and then reconcile 
multiple detections later. 

Beginning from the list of reliable run/camcol/field 
combinations, we then ran a set of scripts to loop 
over those combinations and measure the galaxy shapes 
within a field. In practice, this process ran on anywhere 
from 10--100 processors simultaneously, enabling us to 
process 777 SDSS runs in around 4 weeks. 



In part, this choice is meant to avoid selection biases that 
can arise if the galaxy detection has significantly diff'erent 
resolution or apparent flux in the two runs, as can happen due 
to sky noise fluctuations for galaxies near the detection limit. 
Moreover, it means that while our source number density is 
a function of the imaging conditions, it is not a very strong 
function of the number of runs overlapping a given position. 
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Within a given field, we first obtain all necessary in- 
formation from the Photo v5_6 outputs: astrometry in- 
cluding colour-dependent terms, PSFs, photometric cal- 
ibration including flat-fielding corrections, catalogue of 
selected objects passing the S/N > 5 Photo object de- 
tection threshold, and information needed for the noise 
model (the gain, dark variance, and sky level). We then 
impose some preliminary, loose galaxy selection criteria: 



(i) The object must be classified as a galaxy 
(0BJC_TYPE==3). Photo carries out star/galaxy 
separation by comparing two measures of photometry, 
the cmodel and PSF magnitudes. The cmodel magni- 
tudes are obtained from the best-fitting non-negative 
linear combination of two profiles: the best-fitting de 
Vaucouleurs model and the best-fitting exponential 
model, each determined via separate fits to the object 
light profile. The PSF magnitudes are simply deter- 
mined by fitting the object light profile to a PSF, only 
allowing the amplitude to vary. Objects with psfMag 
— cmodelMag > 0.145 are classified as galaxies. While 
this has been shown ( Mandelbaum et al.|2005a 2008b I 
to be somewhat inaccurate at r > 21, and probabilis- 
tic methods ( [Scranton et al.||2002[ ) based on additional 
criteria do a better job. We find that (a) stars that are 
accidentally classified as galaxies do not end up in our 
shape catalogue because they fail our resolution cut (we 
quantify this statement in Sec. 4.41 applied at a later 



stage of processing, and (b) galaxies that are acciden- 
tally classified as stars would have failed our resolution 
cut the vast majority of the time anyhow. So, the use 
of OBJC-TYPE in this case does not lead to significant 
stellar contamination or loss of useful galaxies. 

(ii) SDSS fields within a given run are defined such 
that they overlap by 24.5" with the next field. We dis- 
card objects located in the overlap region in one of the 
two fields in which they appear, to avoid duplicate de- 
tections. 

(iii) We apply cuts to the r and i band model magni- 
tudes (r < 22, i < 21.6) before correcting for galactic ex- 
tinction. We will later impose additional cuts to obtain a 
catalogue with a flux cut that is constant in extinction- 
corrected magnitudes. The model magnitudes are de- 
fined using the better of an exponential or de Vau- 
couleurs fit to the object light profile in the r band; fits 
in the other bands simply rescale the 2d r-band model, 
allowing for a stable determination of galaxy colours 
needed for photo-z estimation. 

(iv) We require a robust detection of the galaxy at 
S/N 5 in the unbinned data in r and i bands (the 
BINNEDl fiag should be set in r, i, and overall). 

(v) We exclude galaxies that have the following flags 
set: SATURATED, SATURATED.CENTER, EDGE, 
LOCAL_EDGE, MAYBE_CR, MAYBE_EGHOST, 
SUBTRACTED, BRIGHT, TOO_LARGE, BADSKY. 

(vi) We reject galaxies that have the BLENDED flag 
set, unless NODEBLEND is also set. This avoids the 
measurement of shapes for deblended parents and their 
children. Note that the SDSS deblender will set this flag 



for galaxy pairs that are nearby (it does not mean that 
the light profiles actually strongly overlap with that of 
another object). The result is that ~ 15-20 per cent of 
all detected objects in SDSS are deblendedp^ 

(vii) There is a flag cut that depends on the galaxy 
apparent magnitude: those that are fainter than r = 
19.2 are rejected if the INTERP and CR flags are set 
in r or i; those that are brighter are rejected if IN- 
TERP .CENTER is set in r or j. This different treatment 
is done because the brighter galaxies tend to take up 
sufficient area that they might overlap with a bad pixel 
requiring interpolation simply by chance, so we are more 
permissive in allowing for interpolation, as long as it is 
not too close to the object centroid. Note that r < 19.2 
galaxies are 11 per cent of the catalogue, but their rel- 
ative weight in any lensing analysis is low because they 
are quite low redshift, and therefore are either in front 
of many lens samples, or receive a low weight due to the 
small lens-source separation. 

(viii) There is a very preliminary and loose resolution 
cut on the galaxy resolution (where the quantity used 
to impose this cut is defined below). 

For all of the galaxies passing the above cuts, we 
first obtained the full PSF estimate from the PSP 
pipeline (Lupton et al. 2001| |. This PSF estimate is 



obtained via a Karhunen-Loeve (KL) transform, which 
uses a set of bright stars to determine basis functions 
and then to fit their coordinates to spatially varying 
(quadratic) functions. It can be reconstructed for a 
given SDSS run, camcol, field, and filter as a function 
of position on the CCD using the publicly available 
READ_PSF C cod^HUthat reconstructs the basis functions 
and the variation of the coefficients across the field from 
the SDSS psField files. 

Then, we ran the re-Gaussianization PSF correc- 
tion software (Hirata & Seljak 2003 1 on the r and 
i band Atlas images (which are postage stamp im- 
ages with the sky level subtracted and pixels belong- 
ing to other objects masked out, that can be read using 
the READ _ATL AS -IMAGE code that is part of the same 
code package as read_psf). This code measures PSF- 
corrected galaxy ellipticities according to the shape def- 
inition in Eq. ([8|, where ellipticities are derived using 
the "adaptive moments." 

In general, the definition of moments requires per- 
forming sums over the image that are the discrete ap- 
proximation to the following integrals: 



^^^(mcthod) ^ / j^^^ lUmcthod(a3) 



X (a; — xo)i{x — a;o)jda;. 



(Al) 



The adaptive moments are the results of minimizing the 
integral 

2 



E : 



I{x) — A exp 



d^a;(A2) 



^® http : //www . sdss . org/DRT/algorithms/deblend.html 

http : / /www . astro .princet on. edu/"rhl/readAt las Images .tar . gz 
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over the quantities {A, xo, M). This procedure amounts 
to weighting by a weight function w^^'^^'^^\x) corre- 
sponding to the best-fitting elliptical Gaussian that rep- 
resents the image itself, which in practice is determined 
iteratively. Given the moment matrix M, we can define 
ellipticity via 



My 



ei 



62 = 



M^^ + My 



2M. 



+ My 



(A3) 



Given the adaptive moments of the observed galaxy 
images (Mi) and the PSF model interpolated to the 
position of that galaxy (Mp), we can make a simplest 
definition of a resolution factor 



i?2 



, simple 



= 1 - 



Tp 
Ti 



(A4) 



in terms of the traces of those moment matrices. This i?2 
tends to 1 for well-resolved galaxies and for those that 
are completely unresolved. The initial, loose resolution 
factor cut used to select galaxies for shape measurement 
is i?2 > 1/4 in either r or i band; after carrying out the 
shape measurement we will impose a more stringent cut 
to be described below. 

In the case of a Gaussian PSF and galaxy, the PSF- 
correction could be trivially carried out via subtrac- 
tion of the moment matrices. The re-Gaussianization 
method is specifically designed to address both the de- 
viation of the galaxy and the PSF from Gaussianity to 
some order. 

We begin by correcting for the non-Gaussianity of 
the PSF in a way that is exact to first order in PSF non- 
Gaussianity. To do so, the code finds the best-fitting 
Gaussian approximation to the PSF (which is generally 
more extended than a Gaussian), and uses the fit resid- 
ual to construct an image I' of the galaxy as it would 
have appeared with a Gaussian PSFHThen, using the 
re-Gaussianized image, a PSF correction is carried out 
on the galaxy and PSF moments, using the procedure 



from Bernstein & Jarvis ( 2002 1 to correct for the non- 



Gaussianity of the galaxy to first order. In the course of 
carrying out this procedure, we define a new resolution 
factor 



i?2 = 1 - 



Tp 

T,: 



(A5) 



using the moment matrix of the re-Gaussianized galaxy 
image. This resolution factor definition is used for all 
subsequent cuts on resolution factor. 

Finally, we must define the shape measurement er- 
ror (Te per component. To do so, we use a simple estima- 



tor from Bernstein & Jarvis ( 2002 1 which is equivalent 
to CT-y = 2/v (y \s the significance of the detection), 
where ae = 2a-y. In terms of the actual quantities that 
we actually measure, we first define a sky variance as 



2 

0"sky 



sky 2 

: \- O-dark, 

gam 



(A6) 



where the first term results from the Poisson noise due 
to the photons in the sky, and the second is due to 
the dark current (current that builds up due to heat 
even in the absence of photons). The sky level is high 
enough that the noise is effectively Gaussian, and it is 
uncorrelated from pixel to pixel. Moreover, the sky noise 
dominates over the noise from the galaxy flux for the 
large majority of galaxies in the catalogue, for r 20. 
Then, we determine 



(A7) 



47rcrskyCr/ 
FR2 



where (jj = det Mi and F is the total galaxy flux. We 
present tests of these values in Sec. |4.3.2[ 

After all flelds were processed, we ran a reconcili- 
ation procedure to decide between multiple detections, 
and impose our final (more stringent) set of galaxy se- 
lection criteria. To do so, we first eliminated all galax- 
ies at positions with r-band extinction Ar > 0.2. For 
galaxies passing this cut, we collected all detections of 
any single galaxy from all the runs that were processed 
through the pipeline (using a tolerance of 1" to define 
multiple detections). Then, we chose the detection in 
the observation with the smallest PSF FWHM as our 
primary detection of that galaxy. Finally, for the full 
list of galaxies (now using only the primary detection 
of each one) we required that the extinction-corrected 
r-band model magnitude satisfy r < 21.8. 

For each galaxy, we have both r- and i-band shape 
measurements. We combine the measurements in the 
two bands as follows: we define the galaxy S/N in each 
band a as being /a/cr/.a where / is the model fiux. We 
then weight the galaxy shape measurements {ei^a, &2.a) 
by (S/N)'^, and take the weighted average. For the pur- 
pose of our science analyses, we require shape measure- 
ments in both r and i bands, with R2 ^ 1/3 in each. 
Additionally, we require etot = \/ef^Fef < 2 (where 
ei and 62 are the band-averaged ellipticities). This cut 
helps avoid shape measurements that are excessively 
dominated by noise, while at the same time avoiding 
selection biases that can be incurred by imposing the 
apparently more physical cut of etot < 1 (since noise 
can result in observations with etot > 1, which means 
that imposing a cut at that value results in cutting off 
part of the error distribution, biasing the mean). The 
resulting catalogue has 43 378 516 unique galaxy detec- 
tions in 9 493 deg'^. 

The final step was to run the template-based 
Zurich Extragalactic Baycsian Redshift Analyzer (ZE- 
BRA, Feldmann et al. 2006) for all galaxies in the cat- 
alogue. The procedure used in detail is described in 
Nil. In brief, we used a set of templates from |Coleman| 
et al. ( 1980 1 observed across a long wavelength base- 
line in the local universe, supplemented by synthetic 



For more detail, see Hirata & Seljak 1 2003 I 



starburst spectra by |Kinney et al. (1996), and then in- 
terpolated to create a full set of 31 templates in to- 
tal. We ran ZEBRA in the maximum-likelihood (ML) 
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mode, then selected the photo-z based on the peak like- 
hhood marginahzed over template, using a 2 < 1.5 prior 
which is reasonable for single-epoch SDSS photometry. 
We did not use any of the following ZEBRA options: 
photometry self-calibration, template optimisation, or 
Lyman-a IGM absorption. As described in Nil, which 
quantifies the effect of the photo-2 bias and scatter on 
galaxy-galaxy lensing measurements, we imposed addi- 
tional galaxy cuts based on the ZEBRA outputs, re- 
quiring that (a) the resulting photo-z not be one of the 
boundary values (0 or 1.5), and (b) the template not be 
one of the two starburst templates (or an interpolated 
one in that range), since the galaxies that are classified 
as starburst typically have unusually large photo-z er- 
rors (the spectra of star-burst galaxies are sufficiently 
featureless in our range of wavelengths that photo- z; es- 
timation is very difficult). After imposition of those cuts, 
eliminating ~ 10 per cent of the sample, the catalogue 
contains 39 267 029 galaxies. 



APPENDIX B: DIFFERENCES FROM M05 

In terms of area coverage, the catalogue from M05 in- 
cluded imaging data acquired until 2004 June 15 (imag- 
ing run 4682), whereas this catalogue includes all pub- 
licly available imaging data from SDSS. As a conse- 
quence, after all quality cuts were imposed, the resulting 
area increased from 7 002 to 9 493 deg'^. For the science 
results presented in this paper, we use a subset of that 
area covering the portion of the DR7 spectroscopic sam- 
ple area covering our extinction cut, 7 131 deg^. Note 
that the 7 002 deg^ of the original source catalog in- 
cludes some imaging area without spectroscopy; it is 
not strictly a subset of the DR7 spectroscopic sample. 
Use of the original source catalogue would require us to 
eliminate ~ 20 per cent of the DR7 spectroscopic lens 
sample. 

The M05 catalog relied on Photo v5_4 outputs. 



Aihara et al. (20111 detailed the differences between 



v5_4 and v5_6 used for the new catalog; in brief, the pri- 
mary difference that is relevant for our purposes is that 
the new version of Photo has an improved (but not 
fully corrected) sky subtraction algorithm that corrects 
some of the deficiencies in faint galaxy detections near 
bright objects first noted in M05 and subsequently doc- 
umented in SDSS data release papers (DR4, [Adelman^ 
McCarthy et al.]|2006 and others). We discuss the im- 
pact of the residual sky errors for this catalog in Sec. |4.5| 
An additional difference between the catalogs is the 
updated photometric calibration. The old catalog used 
calibrations based on the 0.5-m Photometric Telescope 



(PT, Tucker et al. 20061, which became defunct part- 
way through the survey. As a consequence, the photo- 
metric calibration was wrong for ~ 4 per cent of the 
area of the old catalog, since the PT calibration files 
contained incorrect results due to their not being main- 
tained. This error in 4 per cent of the area, and in gen- 
eral, the worse photometric calibration performance of 



the PT calibrations with respect to ubercalibration, af- 
fected both object selection and also the photo-z perfor- 
mance. In the new catalog, we use the uber-calibration 
procedure ( Padmanabhan et al.|200"8 l, which provides a 
stable photometric calibration at the 1 per cent level for 
griz, and 2 per cent for u, yielding greater uniformity 
in galaxy selection and photo-z. 

The catalogue from M05 employed two methods of 
estimating redshifts for the source galaxies. For those 
galaxies at r < 21 (extinction-corrected model magni- 



tude), the KPHOTOZ (v3_2, Blanton et al. 2003b) was 



used. Unfortunately this code tended to fail for galaxies 
at fainter magnitudes, so for those at r 21, we simply 
utilized a source redshift distribution dN/dz motivated 
by early data from the DEEP2 survey data in the Ex- 
tended Groth strip ( Davis et al.|2003 |. Significant later 
work ( Mandelbaum et al.|2008b I was necessary to quan- 
tify more precisely the biases in the lensing signal due 
to these two separate methods of redshift estimation. 
For the new catalogue, as described in Sec. [A] we have 
utilized a single photo- 2: code to calculate photometric 
redshifts for all sources regardless of their apparent mag- 
nitude, ZEBRA. Extensive tests of the impact of using 
the ZEBRA photo-2; on the calibration of the galaxy- 
galaxy lensing signal in SDSS were presented by Nil. 
The use of a single method of redshift estimation simpli- 
fies the process of using the catalog to calculate lensing 
signals, since we no longer have to calculate them sep- 
arately for r < 21 and r > 21, correct for the different 
biases, and then combine them. 

An additional small difference from the M05 cat- 
alogue results from the correction of two bugs in the 
process of determining the PSF to use for PSF correc- 
tion. The first bug resulted from incorrect usage of the 
READ_PSF software to extract the KL PSF from the ps- 
Field files: in particular, the CCD row and column were 
swappedp^In practice, this amounts to noise in the PSF 
model used for PSF correction, since the PSF does not 
vary systematically when we flip the CCD along the di- 
agonal. The noise in the PSF model becomes noise in 
the estimated shear. 

The second bug was that the PSF returned by 
the READ_PSF software included a "soft bias" of 1000 
counts per pixel, but this soft bias was not subtracted 
off before using the PSF for PSF-correction. While this 
sounds alarming in principle, in fact the PSF model it- 
self has a very high flux normalization (peak flux of 
3 X 10* counts) so this additive constant is not very no- 
ticeable. However, it does make the PSF seem slightly 
more extended than it actually is, an effect that is quan- 
tified in Sec. |4.3[ As shown there, a comparison of shapes 
in the old vs. in the new catalog can demonstrate the 



The SDSS fields are not square, so this bug resulted in the 
code requesting that READ_PSF return PSFs outside of the 
range of row and column that should be included in the field. 
However, READ_PSF simply extrapolated the PSF models off 
the edge of the field without complaint. 
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impact of these bugs, which turns out to be quite minor 
(~ 1-2 per cent). 
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